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It is sometimes pointed out as a curiosity that the state space of quantum two-level systems, i.e. 
the qubit, and actual physical space are both three-dimensional and Euclidean. In this paper, we 
suggest an information-theoretic analysis of this relationship, by proving a particular mathematical 
result: suppose that physics takes place in d spatial dimensions, and that some events happen prob- 
abilistically (not assuming quantum theory in any way). Furthermore, suppose there are systems 
that carry "minimal amounts of direction information", interacting via some continuous reversible 
time evolution. We prove that this uniquely determines spatial dimension d = 3 and quantum theory 
on two qubits (including entanglement and unitary time evolution), and that it allows observers to 
infer local spatial geometry from probability measurements. 



I. INTRODUCTION 

The fact that the state space of quantum two-level 
systems - the Bloch ball - and physical space are both 
three-dimensional and Euclidean has been regarded as a 
remarkable coincidence for many years, provoking inter- 
esting ideas and lines of research. Building on this obser- 
vation, von Weizsacker [TJ [2] constructed his "ur theory" 
as an attempt to derive spacetime from quantum me- 
chanics. Similarly, Penrose's twistor theory [3] was built 
on the idea that the geometry of physical and quantum 
state space are fundamentally related, which was elabo- 
rated further by Wootters [3] pointing out the relation 
between quantum state distinguishability and geometry. 

The idea that the quantum bit state space and physi- 
cal space are somehow logically intertwined has become 
a widespread paradigm, cf. [Hj . But what is the exact re- 
lationship - which one of the two determines the other? 
Could a similarly nice relationship also exist in other di- 
mensions, or is there something special about d = 3? 

The goal of this paper is to offer a particular 
information-theoretic analysis of these questions: we 
show that a certain natural interplay between geometry 
and probability is only possible if space has three dimen- 
sions, and if outcome probabilities of measurements are 
exactly as predicted by quantum theory. This result sug- 
gests to explore the idea that neither quantum theory 
nor spacetime are separately fundamental, but that both 
might have a common information-theoretic origin. 

Our approach rests on some natural background as- 
sumptions. Suppose that physics takes place in d spatial 
dimensions (and one time dimension), and some of the 
physical processes involve probabilities. That is, there 
exist experiments with random outcomes - we can imag- 
ine that physicists, or nature, prepare physical systems 
in certain states, and later on, measurements on the sys- 
tems reveal outcomes with certain probabilities. We do 
not assume that those probabilities are necessarily de- 



scribed by quantum theory. 

Then we consider the situation depicted in Fig. [I] we 
have two agents, traditionally called Alice and Bob. Al- 
ice's goal is to send some spatial direction - that is, a unit 
vector x e R d - to Bob, by encoding it into some state 
ui(x) and sending a physical system that carries this state. 
We assume that they do not share a common coordinate 
system, such that Alice cannot simply send a classical de- 
scription of x. Bob holds a measurement device that can 
be rotated in space, which he can apply to the state that 
he received, obtaining one of finitely many possible out- 
comes. The outcome probabilities depend continuously 
on the device's spatial orientation. Furthermore, suppose 
that the following four postulates are satisfied: 

1. Alice can encode any spatial direction x € K d into 
some state such that Bob is able to retrieve x in 
the limit of many copies. 

2. It is impossible for Alice to encode any further in- 
formation into the state without adding noise to 
the direction information. 

3. There is a unique way to add up single-system ob- 
servables on pairs of systems. 

4. The state-carrying systems can interact pairwise by 
continuous reversible time evolution. 

As we show below, these postulates can only be satisfied 
if d = 3 and if these systems, and pairs of them, are 
described by quantum theory. That is, we derive the 
three-dimensionality of space, two-qubit quantum state 
space and unitary time evolution as the unique solution. 

These postulates declare some actions as possible or 
impossible: it is possible to let two systems interact, but 
impossible to encode more than a spatial direction into 
one system (we define below what this means in detail). 
This approach is in line with other recent developments 
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FIG. 1: The situation considered in this paper. Bob holds a macroscopic measurement device that he can rotate in d-dimensional 
space; its orientation in space is thus described by a unit vector ("direction") y g R d . Alice's goal is to send a spatial direction 
x £ K d , \x\ = 1, to Bob, by encoding it into a suitable state oj(x). After obtaining the state, Bob measures it with his device, 
obtaining one of several possible measurement outcomes with some probability (indicated by a flashing lightbulb in the picture). 
After obtaining many identical copies of uj(x), and measuring it in many different directions y £ R d , Bob is supposed to estimate 
Alice's direction x, such that his guess becomes arbitrarily close to Alice's actual choice in the limit of infinitely many copies. 
We assume that Alice and Bob have agreed on an arbitrary protocol beforehand, but they do not share a common coordinate 
system, such that Alice cannot simply send a classical description of x. 



like information causality [6], where postulates of im- 
possibility of certain information-theoretic tasks are ex- 
ploited to derive properties of physical theories. These 
approaches also have successful historical examples, like 
the postulate of impossibility to build a perpetuum mo- 
bile of the second kind in thermodynamics. 

The approach in this paper may be interpreted as the 
application of novel mathematical tools to the old ques- 
tion of the relation between geometry and probability. 
These tools have their origin in the recent wave of ax- 
iomatizations of quantum theory [7U12). in particular in 
Hardy's seminal work |8j, and are inspired by recent work 
on quantum reference frames [TBHT7] . 

The first part of this paper consists of an introduction 
to one of these tools, which is the framework of convex 
state spaces, generalizing quantum theory in a natural 
way. Then, the first two postulates will be defined in 
more detail, and will be used to derive the state space of a 
single system. Finally, joint state spaces and the remain- 
ing two postulates will be discussed in detail, yielding our 
main result. Throughout the paper, only the main ideas 
and proof sketches are given; the full proofs are deferred 
to the appendix. 



II. SETTING THE STAGE: CONVEX STATE 
SPACES 

The framework of convex state spaces - also called gen- 
eral probabilistic theories - has proven useful in the con- 
text of quantum information theory [8, 18-23J, but dates 
back much further 26 30 . We now give a brief introduc- 
tion; other useful introductory sources include [3"TH3~i] , in 
particular Chapters 1 and 2 in the paper by Mielnik [35] . 

Consider the simple physical setup in Fig. [2j We have 
a preparation device, which, whenever it is operated, gen- 
erates an instance of a physical system (for example, a 
particle). We assume that we can operate the prepara- 
tion device as often as we want (say, by pressing a button 
on the device, or by waiting until a periodic physical pro- 
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FIG. 2: Schematic of the physical setup underlying the frame- 
work of convex state spaces. 



cess has completed another cycle). In the end, the sys- 
tem can be measured, by applying one of several possible 
measurement devices with a finite number of outcomes. 

The intuition is that the device prepares the system in 
a certain fixed state oj; operating the preparation device 
several times produces many independent copies of uj. To 
define exactly what we mean by that, consider any fixed 
measurement device M.. If M. is applied to the prepa- 
ration device's output, we assume that we get one of k 
different measurement outcomes probabilistically, where 
k 6 N is an arbitrary natural number (in Fig. [2j we have 
k = 2, represented by the two dots). The probability to 
obtain the i-th outcome (where 1 < i < k) is denoted 
M (i Huj), such that Ei^ (i) M = !• 

Suppose we have two devices, both preparing the same 
type of physical system, but in two different states (f and 
uj. Then we can use them to build a new device that 
performs a random preparation: it prepares state uj with 
probability p, and state tp with probability 1 — p. The 
resulting state will be denoted puj + (1 — p)ip. This is a 
convex combination of uj and (p. If we apply measure- 
ment Ai to that state, we will get the i-th outcome with 
probability 

M® (poj + (1 - p)<p) =pM (l \uj) + {l-p)M {l) {ip) 

by the basic laws of probability theory. In summary, we 
see that states uj of some physical system are elements of 
a real affine space which we denote by some capital let- 
ter, say A; single measurement outcomes are described 
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by affine maps M.W : A — >• M. which yield values between 
and 1 for every state. Maps of this kind will be called 
effects. Full measurements are described by a collection 
of effects {A4^}f =1 that sum to unity if applied to any 
state. The set of all possible states of the correspond- 
ing physical system will be denoted CIa, the state space. 
It is a bounded subset of A. We have just seen that 
uj G CIa and tp G CIa imply that pu> + (1 — p)ip G CI a 
for all < p < 1; this means that CI a is convex. We 
will only consider finite-dimensional state spaces in this 
paper. Since outcome probabilities can only ever be de- 
termined to finite precision, we may (and will) assume 
that CIa is topologically closed. 

As a simple example, consider a physical system which 
resembles a classical bit, or coin. We can perform a mea- 
surement by looking whether the coin shows heads or 
tails; think of a two-outcome device which yields the first 
outcome if the coin shows heads, and the second other- 
wise. The possible states are then characterized by the 
probability p G [0, 1] of obtaining heads. The state space 
becomes a line segment, with all states being probabilis- 
tic mixtures of two pure states that yield either heads or 
tails deterministically, see Fig. [3^,). 




FIG. 3: Examples of convex state spaces: a) is a classical 
bit, b) and c) are classical 3- and 4-level systems, d) is a 
quantum bit, e), f) and g) are neither classical nor quantum, 
even though e) can naturally be embedded in a qubit. Note 
that quantum n-level systems for n > 3 are not balls [36] . 



The state spaces of a classical three- and four-level sys- 
tem are also shown in Fig. [3j b) and c): they are an 
equilateral triangle, resp. a tetrahedron. In general, the 
state space of a classical n-level system is the set of all 
probability distributions (p%, . . . ,p n ), which is an (n— 1)- 
dimensional simplex. 

Quantum state spaces look quite different. Quantum 
bits, the states of spin-1/2 particles, can be described 
by 2 x 2 complex density matrices p. These can al- 
ways be written in the form p = (1 + r ■ <?)/2, where 
f is an ordinary real vector in R 3 with \r\ < 1, and 
a = (o- x ,o- y ,a z ) denotes the Pauli matrices [37]. We can 
consider r = (r x ,r y ,r z ) as the state of the qubit. Thus, 
the state space is a three-dimensional unit ball as shown 
in Fig. Pp). A spin measurement in the z-direction may 



be described by the two effects M^{r) = (1 + r z )/2 
and MS 2 >(f) = (I — r z )/2, for example, where the two 
outcomes correspond to "spin up" and "spin down" , re- 
spectively. 

However, the state space of a quantum ndevel system is 
only a ball for n — 2; for n > 3, quantum state spaces are 
not balls, but intricate compact convex sets of dimension 

n 2 - 1 [3HU3I]. 

Given any state space CIa, all effects, i.e. affine maps 
M : A — > E with M{uj) G [0, 1] describe outcomes of con- 
ceivable measurement devices. We can work out the set 
of these maps from a description of CIa- In general, some 
of these measurements might be physically impossible to 
implement; in order to describe a physical system, we 
have to specify which ones are possible and which ones 
are not. 

From the effects, we can construct expectation values 
of observables, simply called observables in the following. 
These are arbitrary affine maps h : A — > K; in quantum 
theory, they are maps of the form p i— > tr(pH), where 
H = W is any self-adjoint matrix. One way to measure 
an observable (on many copies of a state) is to write it as a 
linear combination of effects, h = J^i hiM.i, hi G M, and 
to measure the effects M.i on different copies (in general, 
they may not be jointly measurable on a single copy and 
thus be outcomes of different measurement devices). 

Similarly, we can describe reversible transformations 
of a physical system: these are physical processes that 
take a state to another state, and may be inverted by 
another physical process (in quantum theory, these are 
the unitaries, mapping p to UpU'). Since they must 
respect probabilistic mixtures, they must also be affine 
maps. Due to reversibility, they map the state space 
CIa onto itself - they are symmetries of the state space. 
The set of reversible transformations on A is a closed 
subgroup Qa of all symmetries of CIa- 



III. SINGLE SYSTEMS: POSTULATES 1 AND 2 

We consider a particular situation where measure- 
ments take place in d-dimensional space, with one time 
dimension. For simplicity, we assume that there is a fixed 
flat background space, such that there is a unique way 
to transport vectors from one laboratory A to another 
distant laboratory B (however, we think that our results 
may apply to more general situations). We will also as- 
sume that all physical operations considered in the fol- 
lowing, such as measurements, are performed locally in a 
way such that all parties (particles, measurement devices 
etc.) are relative to each other at rest [75]. Thus, we do 
not have to consider conceivable relativistic effects. 

In general, there may be many different kinds of phys- 
ical systems described by convex state spaces. We now 
assume that there exists a particular type of physical 
system which, in a sense to be made precise, behaves 
like a "unit of direction information" . We will call these 
systems "direction bits" (later on, we show that they are 
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effectively 2-level systems, therefore "bits" , cf. Lemma 19 
in the appendix). We will not specify by what type of 
physical object they are carried - a direction bit could, for 
example, correspond to the internal degrees of freedom of 
a particle, or it could be something completely different. 
We will only assume that a direction bit may come in dif- 
ferent states (matching the framework described above), 
with a state space denoted fid- 
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FIG. 4: We assume that direction bits can be measured by 
some macroscopic measurement device, which yields one of 
several outcomes i £ {1, . . . , k} probabilistically. Due to sym- 
metry, its modus operandi depends only on a vector y £ K , 
\y\ = 1 specifying its "direction" in the local laboratory frame. 
The probability My (to) to obtain the i-th outcome depends 
only on the direction bit state cj and continuously on the di- 
rection y. The device can be rotated in space according to 
any rotation R € SO(d). In the rotated reference frame of 
the device, this corresponds to a reversible transformation on 
the direction bit. 



We assume that direction bits can be measured by a 
certain type of measurement device with a finite number 
of outcomes. As shown in Fig. [4j we imagine that the 
device is implemented as a macroscopic, massive object 
which can be rotated arbitrarily, i.e. can be subjected to 
any SO(d) rotation. Due to some symmetry of the de- 
vice, its orientation in space (locally in the lab) may be 
described by a unit vector y € R d , \y\ = 1, choosing some 
arbitrary but fixed coordinate system in the local labo- 
ratory. Instead of naively thinking of the whole device 
as "pointing in direction y" , we may also think that one 
of the device's components is a vectorial physical quan- 
tity which determines the type of measurement that is 
performed. A standard example in three dimensions is 
given by a Stern-Gerlach device, where y is the direction 
of inhomogeneity of a magnetic field. 

The case d = 1 is special, because 50(1) = {1} is 
trivial, and thus no one-dimensional rotation can map 
the unit vector +1 € M. 1 to the unit vector -1 e R 1 . In 
order to allow Bob to collimate his device in all directions 
also in d = 1, we will thus silently replace 50(1) by 
O(l) = {1, -1} in all of the following. 



Since the measurement which is performed by the de- 
vice may depend on its direction y in space, it is denoted 
My. In the following, by a "direction", we shall always 
mean a unit vector in M. d . For obvious physical reasons, 
we assume that the outcome probabilities My (us) are 
continuous in the direction y. 

Physically, we assume that we can perform a rotation 
R € SO(d) to the measurement device without touch- 
ing the direction bit; this transforms M. v to Mr v , but 
leaves the bit's state ui invariant. The fact that the out- 
come probabilities are altered, from Ady(ui) to M^ y (ui), 
should be understood as a result of the change in the 
relative orientation of the bit and the device. Thus, 
even though direction bits are considered as informa- 
tional "black boxes" with arbitrary physical realization, 
we are forced to adopt the interpretation that direction 
bits carry actual physical geometrical orientation. 

This enforces a certain duality that is familiar from 
quantum mechanics. Suppose that, after rotating the 
measurement device by R, we do not perform the mea- 
surement, but instead rotate the joint system of direction 
bit and measurement device back by R . If it is phys- 
ically unclear how to do this in practice, we can just 
imagine performing a passive coordinate transformation. 

Since this transformation does not change the relative 
direction of the system and measurement apparatus, it 
does not alter the outcome probabilities. However, by 
changing to the new coordinate system, Mr v has been 
transformed back to M y , hence the direction bit state 
must have changed from u> to some other state u>' such 



thatA^y^(w') = M.fi' y (uj). The state transformation 
ui' can be physically undone (by rotating the joint system 
again by R), hence it must be an element of the group 
of reversible transformations on fid- We call it Gr-i, 
such that we can switch from the "Heisenberg" to the 
"Schrodinger" picture via 

M { g{u>)=Mf{G R -,u). 

Clearly Gr oG s = Grs', in other words, the map R t— > 
Gr is a group representation of SO(d) on the direction 
bit state space. 

Now suppose we have a situation where two agents (Al- 
ice and Bob) reside in distant laboratories as depicted in 
Fig. [T] Imagine that Alice holds an actual physical vector 
x € M. d ( all vectors and rotations will be denoted with re- 
spect to Alice's local coordinate system in the following), 
and she would like to send this geometric information 
to Bob. Since Alice and Bob have never met, they have 
never agreed on a common coordinate system. Thus, it is 
useless for Bob if Alice sends him a classical description 
of x, because he does not know what coordinate system 
the description is referring to. 

However, if Bob holds a measurement device as in 
Fig. gj Alice can send him a direction bit in some state 
lj. As usual in information theory (taking into account 
the statistical definition of states), we analyze the prop- 
erties of a single state u> by considering many identical 
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copies of that state. So suppose Alice sends many in- 
dependent copies of oj to Bob. On every copy, Bob can 
measure in a different direction, and he may find that 
some outcome probabilities are varying over the differ- 
ent directions y G M. d , \y\ = 1. This breaks rotational 
symmetry, and so may be used by Alice to send physical 
direction information to Bob. 

However, Alice cannot send information about the 
length of the vector x to Bob, if we assume that Bob 
can only rotate his device (as in Fig. Bland not more. 
Thus, restricting to the situation in Fig. [lj we state that 
Alice's task is to send a direction vector x G R d , \x\ = 1, 
to Bob, by encoding it into some state. 

Postulate 1 (Encoding). There is a protocol (as in 
Fig. [T]) which allows Alice to encode all spatial directions 
x G R d , \x\ = 1, into states lj(x) G fid, such that Bob is 
able to retrieve x in the limit of many copies. 



Denote the coordinates of some vector x G K in Bob's 
local coordinate system by xb ■ Then we stipulate that af- 
ter obtaining n copies of lj(x), Bob makes a guess x^ of 
xb (based on his previous measurement outcomes) such 
that x$ — > xb for n — > oo with probability one. For ob- 
vious physical reasons, we assume that Alice's encoding 
x i — ^ lj(x) is continuous [73]. Moreover, Bob measures 
each direction bit individually and only once (we may 
imagine that direction bits get destroyed upon measure- 
ment [71]). 

In principle, direction bits might carry further addi- 
tional information that can be read out in measurements. 
As a naive example, the physical system that Alice trans- 
mits could be a simple wristwatch, with the watch hand 
pointing in the direction that Alice is intending to send. 
However, a wristwatch is hardly "economical" for this 
task: it carries a large amount of additional information, 
like the details of its head shape etc. Our second postu- 
late says that direction bits should be "minimal" in their 
ability to carry directional information: any attempt to 
encode further information can only succeed at the ex- 
pense of loosing some of the directional information. 

Postulate 2 (Minimality). No protocol allows Alice 
to encode any further information into the state without 
adding noise to the directional information. 

To spell out the mathematical details, we need to de- 
fine what it means that one state ip is noisier in its di- 
rectional information than another state lj. First, by 
directional information of ip, we mean the probability 
functions M^\lp) as seen by direction bit measurement 
device. If we have two states (p, lj with directional prob- 
abilities related by a rotation, i.e. M^iip) = M^ z (lj) 
for some rotation R G SO(d) and all i, we argue that 
both states are equally noisy in this respect - they both 



contain the same "amount of asymmetry" , just pointing 
in different directions. 

We could additionally say that tp and lj are equally 
noisy if H{ip) = H(lj) for some entropy measure H ; how- 
ever, there is no unique entropy definition for arbitrary 
state spaces |4IH43) , and entropy measures acquire mean- 
ing only relative to certain operationally defined tasks 
which is a complication we want to avoid. Therefore, we 
define ip to be at least as noisy in its directional infor- 
mation as lj if its directional probabilities are statistical 
mixtures of those of uj and other states that are equally 
noisy as lj; that is, if there are statistical weights Xj > 0, 
J2j A? = 1' an d rotations Rj G SO(d) such that for all 
outcomes i, 

3 

Clearly, ip is noisier than lj in its directional information 
if it is at least as noisy, and at the same time not equally 
noisy as lj. In Definition [8] and following in Appendix [A] 
we show that this notion is a natural generalization of 
the majorization relation [37) from classical probability 
theory and quantum theory. It also has a natural in- 
terpretation in terms of resource theories [141 138] : for 
any given lj, the probability functions z > M^z\lj) - 
or rather their directional asymmetry - constitute a re- 
source for Bob. One resource is less useful - that is, more 
noisy - than the other if it can be obtained from the other 
by "free" operations; in this case, by tossing a coin and 
performing a random rotation. 

Suppose we had a protocol that satisfied Postulate 1, 
and two states ip =^ lj that would work as a possible 
encoding of some direction x, in the sense that Bob would 
in both cases decode direction x in the limit of obtaining 
infinitely many copies. Then, by choosing to send either 
ip or lj, Alice could send an additional classical bit to 
Bob. Postulate 2 says that this is only possible at the 
expense of adding noise - that is, one of the two states 
must be noisier in its directional information than the 
other. 

Our goal is to determine the shape of the convex state 
space of a direction bit, using only Postulates I and 2 
and the physical background assumptions (Postulates 3 
and 4 will be considered later) . To this end, suppose Alice 
encodes some direction x according to some protocol into 
a state lj{x) and sends many copies of it to Bob. If the 
protocol satisfies Postulates 1 and 2, Bob will be able to 
decode x. 

Now suppose that Bob secretly performed a rotation 
R G SO(d) on his laboratory before the protocol started. 
Since the protocol must work regardless of the relative 
orientation of Alice and Bob, Bob will still succeed to 
obtain an accurate estimate of x as before. 

As we have seen, applying R to a measurement device 
can be replaced by applying Gr-i to the direction bit 
state. Therefore, the following implementation will also 
allow Bob to guess x: 
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• Apply Gr-i to every incoming direction bit; mea- 
sure as in the protocol above. 

• After obtaining n copies, determine the guess x^ 
given by the protocol above. 

• To compensate for the missing lab rotation, output 
the guess 

Suppose that R is in the stabilizer subgroup of x, i.e. 
Rx = x. Then the lines above prove that the original 
protocol also works if Alice sends the state G r -i(oj(x)) 
to Bob instead of uj(x). But these states are equally noisy 
in their directional information, hence Postulate 2 implies 
that they are equal. In other words, we have shown the 
following: 

For any encoding x <— > ui(x), the state oj(x) is invariant 
with respect to all rotations that leave x invariant. 

For what follows, fix x := (1,0,..., 0) T and an arbi- 
trary protocol that satisfies Postulates 1 and 2, yielding 
a state ui{x) that encodes direction x. If Rx = x, then 

M®(w(x)) = M®{G R -iu(x)) = M%(u(x)) for all y. 

Thus, for every i, the probability My (us(x)) is a func- 
tion of the first component of y. As we show in Lemma |T2| 
in Appendix [AJ this has the following consequence: there 
is at least one measurement outcome (call it io) and one 
direction y such that M { y o) {u)(x)) > M^J (u(x)) - oth- 
erwise, the state oj{x) would be "too symmetric" to allow 
the transmission of direction information, and Postulate 
1 would be violated. 

Fix this io. For every direction y that satisfies the 
inequality above (there might be more than one), we de- 
fine a new state uj'(y) by averaging over the stabilizer 
group [H] of y: 



G r uj{x)gIR. 



(2) 



R£SO(d):Ry=y 



From now on, we are only interested in the outcome io, 
and use the abbreviation Ai z '■= M z l0 \ Furthermore, 
for all states ui and directions z, we set 

L z (uS) := M z {uj) - M- z (uj). 

In particular, we obtain L y (ui'(y)) = L y {ui(x)) > 0. As 
we prove in Lemma 13 in Appendix [Aj if y is chosen 



in a clever way, then this is the maximal possible value: 
there is a particular choice of y such that the map z t— > 
L z (u)'(y)) attains its unique global maximum at z = y. 

This property allows us to construct a new protocol for 
Alice and Bob to transmit direction information. Fix this 
particular choice of y, and ui'{y). For all other directions 
z ^ y, define ui'(z) by rotating ui'(y) appropriately, i.e. 
uj'(z) := Gsv'(y), where S € SO(d) is any rotation with 
Sy — z. The new protocol works as follows: 

• Alice encodes some direction x into the state u>'(x) 
and sends many copies of it to Bob. 



• By measuring the received copies, Bob determines 
a good estimate of the function f(z) := M z (uj' (x)). 
Bob's guess is the vector z for which L z (oj'(x)) = 
f(z) — f(—z) is maximal. 

Remarkably, from an arbitrary protocol to transmit 
direction information, we have constructed a standard 
protocol. This involves a difference L z (oj) = A4 z (u>) — 
A4_ z (uj), which has striking similarity to the spin-1/2 
angular momentum expectation value in quantum me- 
chanics |75j : this expression is the expectation value of a 
random variable which assigns to direction z, and 

"—1" to direction (— z). 

Since the new protocol allows Alice to transmit arbi- 
trary spatial directions to Bob, it must satisfy Postu- 
late 2. Thus, if we have two states u and <p that sat- 
isfy Lyito) > L z (uj) and L y (tp) > L z (ip) for all z ^ y, 
they must either be equal, or one must be strictly nois- 
ier than the other, as in eq. ([I]) (exchanging the names 
u) •<-> (p if it is the other way round). As we prove in 
Lemma [7] in Appendix [5J this equation implies for the 
states themselves that <p — ^ . XjG R -iuj, as states turn 
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out to be uniquely determined by their directional infor- 
mation. Since both lo and tp can be used as codewords for 
direction y in our standard protocol, our intermediate re- 
sult one page above implies that Grlu — lo and G R ip = tp 
for all R e SO(d) with Ry = y. 

Suppose that furthermore the maxima agree, i.e. 
L y (w) = L y (ip) =: M holds. Then 

M = Ly(<p) = ^ ^jLy(G R -lU) = ^ Xj L R]V uj . 

i i v " v ' 

<M 

This is only possible if L Rjy u> = M for all j, and thus 
Rjy = y by construction. But then G Rj w — uj for all j 
as just mentioned, and applying G R -i to both sides and 
substituting into the relation between ip and u) proves 
that tp — uj. Thus, if two states encode the same direction 
in our standard protocol, with the same maximal value of 
L, they must agree. This property will now be used to 
determine the state space of a direction bit. 

From now on, x and y will denote arbitrary directions, 
disregarding the special choices above. Call any state ui 
with the property that L x (uS) > L y (oj) for all y ^ x a 
codeword for x. The codewords uj'(x) constructed above 
are in general not the "optimal" ones for the standard 
protocol - we might be able to find "better" ones, ui"(x), 
with L x (lo"(x)) > L x (lo'(x)). The previous inequality 
can be interpreted as saying that the uj"(x) let Bob de- 
termine x more quickly than the codewords oj'(x) in the 
standard protocol above, because the difference in proba- 
bilities is larger and statistically visible after transmission 
of fewer direction bits. 

As we show in Lemma [15] in Appendix |XJ there is an 
"optimal" set of codewords which we call tu x , with the 
property that L x (uj x ) > L x {ui'{x)) for all other code- 
words u)'[x). The codewords for different directions are 
related by rotations: if y — Rx for R £ SO(d), then 
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= GrP->x- Furthermore, there is a constant < a < 1 
such that Ly(uj y ) — a for all y\ we call a the direction 
bit's visibility parameter. 

Given oo x , we can define a "uniform noise" state which 
we call the maximally mixed state \i: 



G R oo x dR - 



R<£SO(d) 



ojrx dR. 



R.eso(d) 



Since all oo y are related by rotations, /i is independent 
of the initial choice of x. As this is an integral over the 
invariant Haar measure, there is constant c <G (0, 1) such 
that M. y (n) = c for all y. We call c the direction bit's 
noise parameter. 

Now suppose oo is any state which is a codeword for 
some direction x. Then A :— L x {u)/a is in the in- 
terval (0,1]. Thus, oo' :— \oo x + (1 — A)/i is a valid 
state, and it is easy to see that it is also a codeword 
for x. But L x (oo') — L x (co), and so the intermediate re- 
sult above implies that u) = a/. Since every state can 
be approximated arbitrarily well by some codeword, we 
have proven that every state oo can be written in the form 
oo = \u x + (l — \)fi for some direction x, where < A < 1. 

We are free to reparametrize the state space Qd via 
some affine map </> : R D —> R D , where D is the di- 
mension of Qd- replacing states via oo <— > Co := <^(w), 



effects via M H> M := M < 



and transformations via 



G 



G :-- 



G > 



does not change any probabili- 



ties or physical predictions. Basic group representation 
theory [44] tells us that we can choose <f> such that the 
transformed group Q acts linearly and contains only or- 
thogonal matrices, and the transformed states Co x (for 
different x) - being connected by reversible transforma- 
tions - have all the same Euclidean norm 1. Moreover, 
the maximally mixed state /t, being invariant with re- 
spect to all transformations, becomes the zero vector. 

Since all states Co are convex mixtures of some Co x and 
/t, we obtain the situation depicted in Fig. [5j the trans- 
formed state space fid is compact convex subset of the 
D-dimensional unit ball, with all oo x on the surface and 
pb = in the center. 

It is easy to see that the maximally mixed state fi is in 
the interior of fid, since it is a mixture of all pure states. 
Hence there is some ball of radius e > around jl = 
which is fully contained in fid- Thus, if v £ M. D is any unit 
vector, then ev/2 must be a valid state in Qd- As we have 
proven above, there is some < A < 1 and some direction 
x £ M. d such that ev/2 — \Co x + (1 — This is only 
possible if e = 2A and v = Co x - in other words, v £ Cld- 
This proves that fid is the full D-dimensional unit ball. 
By construction, the map x t— > Co x is a homeomorphism 
from the unit sphere in M. d to the unit sphere in M. D . This 
proves that D = d. 



Theorem 1. The state space of a direction bit is 
a d-dimcnsional unit ball. 




FIG. 5: After a reparametrization, we obtain that the direc- 
tion bit state space Qd is a compact convex subset of a unit 
ball. Since the maximally mixed state fi is in the interior, 
there is some e > such that the state space contains a full 
e-ball around the origin fi = 0. But we have proven that all 
states Co are convex combinations of /} and some state lo x with 
\lo x \ — 1, thus uo x must thus lie on the line starting at p, = 
and crossing Co. Consequently, all points on the sphere must 
be contained in the state space - we obtain the full unit ball. 
By dimension counting, it is d-dimensional. 



This shows that a direction bit cannot be described 
by a classical probability distribution: it must carry a 
non-classical state space, exhibiting uncertainty relations 
among d independent, mutually complementary mea- 
surements. Probabilistic systems of this type, i.e. ball 
state spaces, have been studied before [^5H17] . In quan- 
tum physics as we know it, there is only one kind of 
system with a ball state space: it is the qubit, a quan- 
tum 2-level state space. It is three-dimensional, which 
coincides with the spatial dimension, confirming the re- 
sult we just proved. By classifying the affine maps from 
the ball to [0, 1], it is easy to check that we must have 



M x {lo) = c + {a/2)(Co x ,Co). 



(3) 



In the familiar three-dimensional case, if c = 1/2 and 
a = 1, this describes a quantum spin measurement in 
direction x; if c ^ 1/2 or a < 1, it is a noisy spin mea- 
surement. 

To see why ball state spaces satisfy Postulate 2, note 
first that two states ip, oo, with corresponding "Bloch vec- 
tors" <p, Co in the d-dimensional Euclidean unit ball, are 
equally noisy in their directional information if and only 
if \ip\ = \Co\; similarly, ip is noisier than oo if and only if 
1 < \Co\ (in the case d = 3, where the state space is a 
qubit, this condition becomes tr(</> 2 ) < tr(w 2 )). For any 
protocol, and any spherical shell of fixed Bloch vector 
norm, there is a one-to-one correspondence of states in 
that shell and spatial directions that these states encode. 
Thus, if two different states encode the same direction, 
they must have different norms, and so one is noisier 
than the other. We say more about the different possible 



protocols in Lemma 21 in the appendix 



IV. FRAME BITS INSTEAD OF DIRECTION 
BITS? 

Before we formulate Postulates 3 and 4 and prove more 
properties of direction bits, let us reconsider one basic 
assumption. As depicted in Fig. |4j we have assumed 
that the orientation of a measurement device in space 
is given by a direction vector, implicitly assuming some 
internal rotational symmetry of the device. What if we 
drop this assumption? In general, the orientation of a 
massive body in M. d is given by an orthonormal frame, 
that is, by some oriented orthonormal basis that can be 
written in the form of an orthogonal matrix X g SO(d), 
instead of a unit vector x £ M. d . Thus, an interesting 
question is what happens if we repeat the calculations 
above, formulating analogues of the postulates for "frame 
bits" instead of direction bits. 

While we have to leave the general answer open, we 
can give the answer in a particular special case. First, 
note that for direction bits as considered above, our cal- 
culations show that Alice and Bob can also apply the 
following protocol: 

• Alice encodes spatial directions x £ W l into the 
particular states tv x . 

• Bob holds a two-outcome measurement device, 
where the first outcome is described by the effect 
My, with y the direction in which the device is 
pointing. Upon receiving (many copies of) some 
state to, Bob looks for the spatial direction y in 
which M. v {ui) is maximal, which will be his guess 
of Alice's direction x. 

Effectively, the device that Bob holds asks the yes-no- 
question "Is it this direction that Alice encoded?" The 
actual direction is the one in which the probability to 
obtain "yes" is maximal. 

Let us now ask whether we can implement the anal- 
ogous protocol for the case that Alice wants to send 
a frame X £ SO(d). The main idea is that X = 



Xd is used to denote the spatial orienta- 



tion of an orthonormal frame attached to an object, 
with the i-th orthonormal vector pointing in direction 
Xi\ rotations R £ SO(d) will thus map this to RX — 



Rxi 



Rxd ■ The protocol is depicted in Fig. 



We formulate analogues of Postulates 1 and 2 (encoding 
and minimality) for this setup, and consider them only 
in the special case of this protocol. 

As we show in Appendix [Bj a calculation very similar 
to the one above proves that the "frame bit" state space 
must be generated by pure states toy, labelled now by 
orthogonal matrices Y £ SO(d) that are connected by 
rotations. In complete analogy to above, every state uj 
can be written in the form uj = Xujy + (1 — A)/i, where 



< A < 1, /i is a maximally mixed state, and Y £ SO(d) 
some frame. Thus, the frame bit state space must also 
be a Euclidean ball of some dimension D. 

At this point, we run into a topological problem: simi- 
larly as for direction bits, the map X i-> ljx turns out to 
be a homcomorphism, this time from SO{d) to the unit 
sphere on R D . Since SO(d) is not simply connected for 
d > 2, but the unit sphere in R D is simply connected 
for D > 3, this is only possible if D = 2 and thus (from 
dimension counting) d — 2. Thus, we have proven that 
there is no convex state space that allows implementation 
of this protocol, while satisfying analogues of Postulates 

1 and 2 - unless d = 2, where a frame is the same as a 
spatial direction, and the setup reduces to the concept of 
the direction bit. (We will rule out d = 2 in Section VI 
using two further postulates.) 



V. SPATIAL GEOMETRY FROM 
PROBABILITY MEASUREMENTS 

Before continuing our derivation, we take another 
slight detour by asking for the relationship between the 
geometry of physical space and state space. 

As indicated in Fig. [4j our setting assumes that macro- 
scopic objects can be physically rotated. The implicit as- 
sumption behind this is that local physical space in the 
laboratory is a vector space with a Euclidean structure, 
that is, with an inner product, that determines the no- 
tion of a rotation as a linear orthogonal map and, at the 
same time, allows to compute angles between vectors. 

We assume that physical rotations R £ SO(d) have 
representations Gr £ SO(d) C Q A on the direction bit 
state space A. As we show in Lemma 23 in Appendix [A"} 
group theory dictates that the map R t-t Gr is linear 
and, moreover, that it is of the form Gr = ORO^ 1 for 
some orthogonal matrix O. Thus, there is automatically 
a correspondence between the vector space and Euclidean 
structures of state space and physical space. This has 
an interesting consequence: it implies that observers can 
measure physical angles by measuring probabilities. In 
other words: even if an observer has no meter stick to 
measure physical angles, she may infer physical angles 
from probabilities. 

In Appendix [C] we give a boot-strapped protocol that 
allows observers to determine angles from probability 
measurements on direction bits. This method general- 
izes the simple quantum-mechanical insight that polar- 
ized electrons with spin-up in a fixed direction give prob- 
ability of spin-up in another direction (of relative angle 
9) with probability cos 2 (0/2). 

This structural coincidence (which is in particular true 
for quantum theory) seems remarkable beyond the spe- 
cific derivation in this paper. Clearly, in this work, we 
start with postulates that assume a Euclidean structure 
in physical space, and obtain the ball state space with its 
reversible rotation transformations as a consequence. It 
is then not very surprising, though mathematically not 
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X e SO(d) 



Y e SO(d) 



"Is it this 
frame?" 



u(X) 



frame bit 




yes 




FIG. 6: The "frame bit" setup for the special case discussed in Section [IV] Instead of a spatial direction, Alice's goal is now to 
send an orthonormal frame X £ SO(d) to Bob, by transmitting some state u)(X) of some arbitrary convex state space. Bob 
holds a macroscopic binary measurement device that can be rotated arbitrarily in space. In contrast to the situation in Fig. [I] 
for direction bits, Bob's measurement device does not possess any rotational symmetry, such that its working is specified by 
the orthonormal basis that defines its spatial orientation, Y € SO(d). Alice and Bob have agreed on the protocol that Bob 
detects the frame Y £ SO(d) in which the "yes" -probability is maximal. This will be his guess for Alice's frame X £ SO(d). 
In contrast to the "direction bit" situation, we prove that there is no convex state space that allows this protocol while at the 
same time satisfying the analogue of Postulate 2 - except for spatial dimension d = 2, where frames and directions coincide. 



trivial, that observers can use this state space structure 
to obtain information on spatial angles. 

However, irrespective of the specific construction in 
this paper, it is interesting to speculate whether the phys- 
ically fundamental order of logic (if there is any) might 
actually be reversed. In Example [39] in Appendix [C] we 
give a modification of the direction bit setup where this 
speculation can be shown to make sense. 

In this example, space is described by a topological 
manifold, and Bob's local laboratory space does not have 
a vector space structure or inner product to begin with. 
We then assume that there are physical processes that 
can in a certain sense be interpreted as generalized ro- 
tations of a device, yielding reversible transformations 
on some convex state space. Under specific conditions, 
we show that Bob can use the coefficients of the mea- 
surement outcome probabilities in the space of effects to 
define natural coordinates on his local physical space. 

In these new coordinates, the generalized rotations act 
linearly and orthogonally on the devices, establishing 
spatial Euclidean structure that was not assumed to be 
there in the first place. Even though our particular ex- 
ample is not meant to describe an actual physical mech- 
anism, it is tempting to speculate whether the Euclidean 
structure of tangent space might be fundamentally inher- 
ited from the convexity of probabilities. 



VI. PAIRS OF SYSTEMS: POSTULATES 3 & 4 

Consider two (distinguishable) direction bits A and B; 
taken together, they form a joint system AB, described 
by some convex state space O^g. In the usual formula- 
tion of quantum theory, the joint state space SIab would 
be given by the density matrices on the tensor product 
Hilbert space - however, in this paper, we do not assume 
quantum theory. 

In full generality, for two state spaces VLa and SIb, the 
framework of convex state spaces allows infinitely many 
possible ways to combine them into some SIab , restricted 



only by a few physically obvious constraints. One of 
them says that if uj a G SI a an d uj b € SIb are two local 
states, then there is a joint state uj a u b € SIab which de- 
scribes the independent local preparation of both states 
on the subsystems, analogous to product states in quan- 
tum theory. Similarly, ii M A and M. B are measurement 
outcomes (i.e. effects) on A and B, then by assumption 
there is a global effect A4 A A4 B which asks whether both 
measurement outcomes have occurred jointly. It satisfies 
in particular 

(M a M b ){uj a uo b ) = M a {uj a ) ■ M B {u B ), 

and can be shown to respect the no-signalling condi- 
tions [20]. Furthermore, we assume that the local state 
space A (B) is closed with respect to postselection ac- 
cording to measurement outcomes on B (A), for details 
see Appendix [A] 

In physics, we are often interested in expectation val- 
ues of observables such as energy or angular momentum. 
Classical as well as quantum physics have an important 
structural property regarding composite systems: sup- 
pose we have two systems A and B of the same type, 
and h is a single-system observable (in quantum theory, 
where states uj a are density matrices, this would be a 
map h(w A ) = tr(o- ' H), where H = H*). Suppose we 
are interested in the sum of this observable on systems 
A and B - this defines a new observable h^> on pairs of 
systems, where 

h i2 \u A uj B ) = h(uj A ) + h(uj B ) (4) 

for uncorrelated states. What if we want to evalu- 
ate h^ 2 '(u> AB ) for correlated (possibly entangled) states 
ui AB ? In quantum theory, there is a unique way to do 
this, because eq. Q uniquely determines and its ac- 
tion on all states. We necessarily have 

h^(uj AB ) = tr [uj ab (IT® 1 + 1® H)] . 

Arguably, this uniqueness is an important property of 
composite systems - if it failed, it would not be clear how 
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to add up observables on composite systems (for example, 
there would be no unique notion of a "non-interacting 
sum of Hamiltonians" , and thus no unique physical way 
to combine systems in trivial non-interacting ways). We 
promote this property to a postulate. 



Some of this interest is due to the fact that many of 
these state spaces contain states with non-local corre- 
lations that are stronger than those allowed by quan- 
tum theory. For example, if VLa = &b is the square 
state space as in Fig. [3^), then the composite f2™ff is 
the no-signalling polytope for two binary measurements 
on two parties, containing PR box states which violate 
the Bell-CHSH inequality stronger than any quantum 
state [501 I5TH53"] . This example also illustrates that the 
convex state spaces formalism describes a vast landscape 
of theories with physical properties that can be very dif- 
ferent from those of quantum theory. 

In the case of two direction bits A and B, where the 
local state spaces are d-balls, there are also many possible 
choices of the global state space fi ab in accordance with 
Postulate 3, including fi™^ and ^™ff . Our fourth and 
final postulate now states that this global state space 
allows for continuous reversible interaction. 

Postulate 4 (Interaction). For two direction bits 
A and B, there is a continuous one-parameter group of 
transformations {T AB } t ^R which is not a product of local 
transformations, T AB ^ T t A T t B . 



The group T A describes continuous reversible time 
evolution in a closed system of two direction bits: if we 
start at time t = with a product state uj a oj b , then 
the state at time t will be ui AB (t) := T ab {uj a u b ). If 
T t AB was a product transformation T A T B for all times 
t, then the global state would remain a product state 
forever: cu AB (t) = (T a uj a )(T b u} B ). In this case, the two 
direction bits could never become correlated; there would 
be no interaction. Postulate 4 excludes this: it states that 
there is at least one time t £ R such that T AB is not of 
this product form. 

The global transformations T AB and the local trans- 
formations G A G B with G A ,G B € SO(d) generate a Lie 
subgroup of Gab] we call it Wab- Due to ([5|, it is a 
matrix Lie group acting on The corre- 

sponding Lie algebra is called t)AB- Let X be some ele- 
ment of i)AB, and consider the circuit in Fig. [7} It depicts 
the outcome probability of a product measurement on an 
evolved product state, 

f(t) :=M A M B (e tx uj A u, B )e [0,1]. 

As we show in Lemma [24] in Appendix |Aj we may as- 
sume without loss of generality that the direction bit 
state space has noise parameter c = 1/2 and visibility 
parameter a = 1. This is the "noiseless case", where 
spin measurements give probabilities A4 x (uj- X ) = and 
Aii-fwj.) = 1, implying in particular that /(0) = 1 for 
the circuit in Fig. [7j Since this is the maximal possible 
value, we must have /'(0) = and also /"(0) < 0. Thus 

M A M B Xu A uo B = 
M A M B X 2 u A u B < 



Postulate 3 (Sums of observables). If h is any 

direction bit observable, then there is a unique observable 
on pairs of direction bits such that 

h<- 2 \u A u B ) = h(u A ) + h(ui B ). 

It is easy to see that Postulate 3 holds true if and 
only if the uncorrelated states uj a uj b linearly span the 
global state space SIab- Thus, Postulate 3 is equivalent 
to a condition that is usually called "local tomography" 
in the literature [8/. it says that joint states on AB are 
uniquely characterized by the local measurement outcome 
probabilities on A and B and their correlations. Denoting 
the dimension of CI a by dA, this is also equivalent to 

d A B + l = {d A + l){d B + l). (5) 

The global state space CIab carries its own group of 
reversible transformations Gab- We assume that Al- 
ice and Bob may still apply their local reversible trans- 
formations, that is, if G A € Ga and G B € Gb, then 
qAqB g G ab ■ Due to Postulate 3, this transformation is 
uniquely determined by its action on uncorrelated states: 
(G a G b )(cj a uj b ) = {G a uj a ){G b uj b ). 

This shows that Postulate 3 also has geometric signifi- 
cance: suppose we decide to carry out a local coordinate 
transformation; in our case, this is a rotation R £ SO(d). 
This transformation acts on states of direction bits via 
uj a H> Gru a . The third postulate now tells us that this 
uniquely determines the coordinate transformation map 
on (correlated) pairs of systems: they are transformed 
via u> AB i — ^ (G rG r)lu ab , which is the only possible lin- 
ear map that transforms oj a ui b into (GrU) A ){Groj b ). 

Every pair of state spaces VLa and CIb can be combined 
into a joint state space n^g in accordance with Pos- 
tulate 3: the "smallest" possible choice (denoted ^ab) 
is to define it as the convex hull of all product states 
lv a uj b . On the other hand, the "largest" possible choice 
(denoted Sl™ff ) is to allow all vectors oj ab such that all 
local measurements yield valid probabilities, even after 
postselection [48]-[50j. Every compact convex set CIab 
which satisfies 

r-)min r— r> C~ O max 
"AS — S MB ^ "Afl 

is then a possible choice of the global state space, as 
long as local reversible transformations map CIab into 
itself. In quantum theory, CIab turns out to be the set 
of unentangled states, while the actual global quantum 
state space VLab lies strictly in between Qa^b an d ^Aff ■ 
Composites of convex state spaces have been exten- 
sively studied in the quantum information literature. 
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for all x,y € K d with \x\ = \y\ = 1. By considering other 
circuits of this kind, we obtain a long list of constraint 
equalities and inequalities which must be satisfied by all 
global Lie algebra elements X. 



.UJ 



A\ 




UJ 



B\ 




,tx 




M 



\M 




FIG. 7: Circuit model which yields constraints for the global 
Lie algebra elements X. We prepare a pure product state 
uj^ojy , apply the transformation exp(tX), and perform a 
product measurement M A My . Since this gives probability 1 
for t = 0, and probabilities cannot be larger than 1 for other 
(small) t, this implies that the derivative at t = must vanish, 
and the second derivative must be non-positive. 



Surprisingly, as shown in Appendix [A] and in |54| , if 
d =/= 3, then the only matrices X which satisfy all con- 
straints are those of the form X = X A + X B . And 
these elements generate non-interacting time evolution 
of product form: exp(tA) = exp(tX A ) exp(tX B ). Thus, 
if d 7^ 3, T-Lab contains only product transformations, 
and Postulate 4 cannot be satisfied. 



Theorem 2. From Postulates 1-4 it follows that 
the spatial dimension must be d = 3. 



The main reason why d = 3 is special becomes visible 
by inspection of the proof in [53]. It boils down to the 
group-theoretic fact that (at least for d > 3) the subgroup 
of SO(d) which fixes a given vector (that is, SO(d — 1)) 
is Abelian only if d = 3. In other words, the fact that 
rotations commute in two dimensions, but not in higher 
dimensions is the main reason why d = 3 survives. The 
cases d = 1 and d = 2 are special as well, but are ruled 
out in the proof for other reasons. 

It remains to show that we actually get quantum the- 
ory for two direction bits if d = 3. We already know that 
the dimension of the global state space is dimil ab — 
(d + l)(d+ 1) — 1 = 15, which agrees with the number of 
real parameters in a complex 4x4 density matrix. Thus, 
we can embed Qab in the real space of Hermitian 4x4- 
matrices of unit trace. Now we have global Lie algebra 
elements X £ \)ab that are not just sums of local genera- 
tors, i.e. X 7^ X A +X B . However, as shown in [55], these 
elements are still highly restricted: they generate unitary 
conjugations, i.e. transformations of the form p i— > U pU' . 

By Postulate 4, at least one of these unitaries must be 
entangling. Moreover, all local unitary transformations 
are possible (in the ball representation, these are the ro- 
tations in SO(3)). It is a well-known fact from quantum 
computation [56] that a set of unitaries of this kind gen- 
erates the set of all unitaries - that is, every map of the 



form p i— > UpW must be contained in the global trans- 
formation group Hab Q Gab- 

The orbit of this group on pure product states gen- 
erates all pure quantum states, and one can show |55j 
that there cannot be any additional non-quantum states. 
Thus, we have recovered the state space of quantum the- 
ory on two qubits. Due to positivity, all effects must be 
quantum effects; in the noisy case (i.e. c ^ 1/2 or a < 1), 
not all quantum effects may actually be implementable - 
that is, we might have a restricted set of measurements. 
We have thus proven: 



Theorem 3. From Postulates 1-4, it follows that 
the state space of two direction bits is two-qubit 
quantum state space (i.e. the set of 4 x 4 density 
matrices), and time evolution is given by a one- 
parameter group of unitaries, p i-> U(t)pU(ty . 



As a simple consequence, there exists some 4x4 Hermi- 
tian matrix H such that U(t) — exp(-iHt), i.e. a Hamil- 
tonian which generates time evolution according to the 
Schrodinger equation. 



VII. CONCLUSIONS 

We have derived two facts about physics 
from information-theoretic postulates: the three- 
dimensionality of space [57], and the fact that proba- 
bilities of measurement outcomes for some systems are 
described by quantum theory. In order to do this, we 
assumed that there exist "reasonable" physical systems 
which, in a certain sense, carry minimal amounts of 
directional information. 

Our result supports and clarifies the point of view that 
the geometric structure of spacetime and the probabilis- 
tic structure of quantum theory are closely intertwined, 
similar in spirit to [IHU [55H5U] . As one can see in Fig. [3j 
this conclusion becomes particularly obvious in the con- 
text of convex state spaces. This interrelation is not only 
axiomatic, but also operational: as we have shown in 
Sec.|VJ observers can measure - or even define - physical 
angles by measuring probabilities. 

Furthermore, these findings suggest exploring possible 
generalizations: the approach to construct state spaces 
from physical symmetry properties |70j . together with 
minimality assumptions, might reproduce quantum sys- 
tems of higher spin, or even physically interesting non- 
quantum state spaces that have so far remained unex- 
plored. 

In summary, there seem to be two possible interpreta- 
tions of the results in this paper. First, the results might 
simply be mathematical coincidence, without any deep 
physical reason underlying them. This is perfectly con- 
ceivable; in this case, the main contribution of this paper 
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is a detailed analysis of the structural fit between quan- 
tum theory and spacetime. Second, the results might 
point to an actual logical relation between geometry and 
probability that arises from some unknown fundamental 
physics, such as quantum gravity. 

If the second possibility turned out to be true, this 
would suggest an exciting speculation, stated also in [611 
162] : In many approaches to quantum gravity, the smooth- 
ness and/or three-dimensionality of space is considered 
to be only an approximation. But then, given the close 
relation between smooth Euclidean space and the qubit, 
maybe the universe's probabilistic theory is only approx- 
imately quantum? Taking this idea seriously would sug- 
gest to go beyond the usual "quantization of geometry" 
paradigm. 
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Appendix A: Characterization of all direction bit state spaces 

The proof consists of four steps: first, we prove that the direction bit state space is a Euclidean ball (possibly noisy, 
that is, with a restricted set of measurements). Then we show that that the noisy case can always be reduced to the 
noiseless case. Given this, the results from Ref. [54] do most of the work: they show that only d — 3 is possible. As a 
last step, in order to obtain quantum theory for d = 3, we refer to the results in Ref. [55] . 

We start with a formal definition of state spaces. As we have motivated in the main text, the set of normalized 
states on any system is a compact convex set. To simplify the calculations, it makes sense to start right away with 
the full set of unnormalized states, which will be all vectors of the form A • u>, where w is a normalized state, and 
A > 0. This yields a cone in the sense of convex geometry - that is, a subset C of a vector space with the property 
that x G C implies Ax G C for all A > 0. 

For reasons of brevity, we will not give a detailed explanation and motivation of all definitions. For more discussion, 
we refer the reader to the references mentioned in the main text, in particular to Chapter 3 in [33] . 

Definition 4 (State space). A state space is a tuple (A,A + ,U a ,Ea), where 

• A is a real finite- dimensional vector space, 

• A + C A is a proper cone (that is, a closed, convex cone of full dimension with A + n (— A + ) = {0}), called the 
cone of unnormalized states, 

• IA A is a linear functional which is strictly positive on A + \ {0} ; called the order unit of A, 

• Ea is a closed convex set of functionals which are non-negative on all of A + , containing U A , having full dimen- 
sion dim(£^) = dim(A), and satisfying Ai(x) < U A {x) for all x G A + and M. G Ea- It is called the set of 
allowed effects. 

Furthermore, we define the dual cone A* + as the set of all linear functionals which are non-negative on A + , which 
implies Ea C A*,. The set of all ui G A + with 14 a (oj) — 1 will be denoted Qa o,nd is called the set of normalized 
states. 

The requirement dim^E^t) = dim(A) has a simple physical motivation: if dim(EA) < dim(A), then we would have 
states uj =/= if that would yield the same outcome probabilities for all possible measurements, invalidating to call them 
"different states" in the first place. 

To save some ink, we will usually just write A for the state space, instead of writing the full tuple. However, keep 
in mind that the choice of a state space comes also with a choice of A + , U A and Ea- 

Given any measurement with an arbitrary number of outcomes, the probability of one of the outcomes - if measured 
on some state w G Qa ~ will be a real number in [0, 1] . The map Ai which takes the state u to the corresponding 
probability M.(w) must be linear, since statistical mixtures of states must yield mixtures of probabilities. In principle, 
every linear functional M. G -E™ ax may describe a measurement outcome probability, where 

£^ ax :={MeA* + I 0<M{lu) < 1 for all u G n A }- (Al) 

However, one may imagine that it might be physically impossible to implement measurement devices for all these 
linear functionals. This is why the set Ea is introduced in the definition above: it is meant to describe the collection 
of all possible effects that may actually be implemented in measurements. Clearly, we have Ea C E^ ax . In some 
publications (e.g. [23]), it is assumed that Ea — -E™ ax , but not in this paper. In other words, wc are not assuming the 
"no-restriction hypothesis" here [23] . The possibility to have Ea ^ -E™ ax describes situations, as we will see below, 
where all measurements on a direction bit are by necessity intrinsically noisy. 
As an example, in finite-dimensional n-level quantum theory, 

• A is the real vector space of Hermitian matrices on C™, 

• A + is the set of positive semi-definite matrices on C n , 

• U A (p) — tr(p) is the trace functional, 

• Ea is the set of all maps of the form p >-» tr(pM), with M > a positive semi-definite matrix, 

• Ha is the set of density matrices on C n . 

Similarly, the state space of classical n-level probability theory is (B, B +1 U B ', B + ), where 
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• B = R n , 

• B + = {P= (Pi, ■ ■ ■ ,Pn) I all pi > 0}, 

• U B {p) =px +p 2 + ... +p n , 

• Eb is the set of all maps p^t p - q with q = (g%, . . . , q n ) such that all qi > 0, where • denotes the Euclidean inner 
product, 



• Qb is the set of all probability distributions: 

Mb = \ P= (pi,...,p n ) 



all p t > 0,y^ y Pi = 1 



In both classical probability theory and quantum theory, all effects are allowed. 

We would like to talk about reversible transformations on state spaces. To this end, we define 

Definition 5 (Dynamical state space). A tuple (A, A + ,U A , Ea,Qa), where (A, A + ,U A , Ea) is a state space, and Qa 
is a compact (possibly finite) group of linear maps on A, is called a dynamical state space, if every G £ Qa satisfies 

• GQa — (or, equivalently, U A o G — U A and GA + = A + ), and 

• E A oG = E A - 

These two conditions say that reversible transformations must respect the set of normalized states and the set of 
allowed effects. It is easy to sec that the first condition implies that A* + o G = A* + for all G £ Qa- 

In quantum theory, Qa is the group of all maps of the form p i— > U pU\ with U unitary. In classical probability 
theory, Qb is a representation of the permutation group. Specifically, for every permutation tt, there is a reversible 
transformation G^ with G^ipi, . . . ,p n ) = (p^i), . . . ,Pn(n))- 

Here is a rigorous definition of equivalence of state spaces: 

Definition 6 (Equivalent state spaces). Two state spaces A and B are equivalent if there exists a bijective linear 
map L : A — > B such that the following conditions are satisfied: 

• LA+ = B + , 

• U B oL=U A , 

• Eb o L = Ea ■ 

Two dynamical state spaces A and B are equivalent, if they are equivalent as state spaces and additionally satisfy 
Qb = LoQ A o It 1 . 

This is clearly an equivalence relation. If two (dynamical) state spaces are equivalent, they are indistinguishable in 
all their physical properties. 

Now we show how the notion of noisiness in Postulate 2 can be seen as a special case of "group majorization" , 
a natural definition of noisiness with respect to a group that encompasses the classical and quantum cases in the 
obvious way. This definition is well-known in the mathematics literature |25j ; we rephrase it in Definition [8] below in 
the context of convex state spaces. We start by showing a simple consequence of Postulate 2. 

Lemma 7. Suppose that uj and ip are both possible encodings of the same direction x £ M. d , \x\ = 1. in some protocol 
that satisfies Postulate 1. From Postulate 2, it follows that there exist < Xj < 1, Xj = 1, and rotations 
Rj G SO(d) (resp. 0{d) ifd=l) such that 

J 



or with tp and uj interchanged. If <p ^ u then this is a proper convex combination. 
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Proof. According to Postulate 2, the assumptions of this lemma imply 

M®(<p) = X i M %»( u ) for a11 directions z G R d , \z\ = 1, 

3 

or vice versa (in the latter case, rename <p and uj to fit this formula). Set oj' :— J2j ^oGrt iU; , then M.z (w') = .Mi (v) 

for all z. But then oj' could be used as a replacement for p in the protocol, namely, as yet another codeword for 
direction x. Moreover, oj' and ip are by construction equally noisy in their directional information, so Postulate 2 
implies that they must be equal. □ 
Now we show how this fits into a majorization framework. 

Definition 8 (Group majorization). Let A be any dynamical state space, and T-L a compact subgroup of Qa- Then 
we define a relation -<u on Qa in the following way: for uj,p G Qa, it holds uj -<u ip if and only if there are A; > 0, 
i = 1, . . . , n, X)i=i = 1> an d Ti £% such that 



OJ 



^%Ti<p. (A2) 



We write uj ^ p if any only if oj ~<q a (p. 



Lemma 9. The noisiness relation is a partial order on the orbits. That is, for any dynamical state space A and any 
compact subgroup % C Qa, we have 

(i) if oj -<fi ip and p -<u p, then oj -<h p. 

(ii) uj -<-}i oj for all oj G Qa, o,nd 

(Hi) if to p and ip -<-h oj, then there exists T G TL such that uj = Tp. 
Moreover, we have 
(iv) if oj -<-}i p then T\oj -<u Tip for all T\,Ti G %. 



Proof. Property (ii) is trivial, by setting Ai = 1 and 7\ = 1 in (A2|. If oj <h tp, then 

T x oj = Y KiTiT.T^Tip T 2 p 

i 

if Ti,T 2 G H, proving (iv). If additionally p <n P such that ip = £V X j T jP> then w = E« X * X j T i T jP P- This 
proves (i). It remains to prove (iii). To this end, introduce an inner product (•, •) on A which is invariant with respect 
to Qa (and thus H), i.e. 

(x,y) = (Tx,Ty) for all x, y G A, T G Q A . 



Moreover, let |] • || be the corresponding norm. Then (A2) and the triangle inequality yield 

i i i 

Thus, if both u p and <p> -<u oj, then ||a;|| = \\p\\ — \\Tip\\ —: r. Let S r be the unit sphere of radius r, then ( |A2[ ) 
says that uj G S r is a convex combination of the Tip G S r . Geometrically, it is clear that this is only possible if 
Tip = oj whenever Ai 7^ (formally, it follows from the fact that all boundary points of the ball are exposed points). 
Setting T :— Ti for any of these i proves (iii). □ 
Now we see how our definition of noisiness from Postulate 2 fits naturally into the well-known notion of majorization. 
In the case of quantum theory (with the full unitary group), it follows from [371 Thm. 12.13] that our relation ^ is 
identical to Nielsen's majorization relation on density matrices. From Lemma [7| we obtain the following: 

Theorem 10 (Noisiness and group majorization). A state p is at least as noisy in its directional information as 
another state uj if and only if 

<P ^SO(d) w, 

where SO(d) denotes the representation of the rotation group within the group of reversible transformations of a 
direction bit. (If d — 1, then SO(d) has to be replaced by 0(1) .) 
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Given two state spaces A and B, we would like to define a composite state space AB which, according to Postulate 
3, satisfies the local tomography property |10j : states on AB are uniquely characterized by the statistics of local 
measurements. Eq. |5]) in the main text translates into dim(AB) — dim^4dimi3; thus, we may choose the vector 
space AB to be the tensor product A ® B. This will turn out to be a handy choice: we can represent independent 
preparations uj a lu b by products uj a ® u B ■ We get the following definition: 

Definition 11 (Locally tomographic composite). Given two dynamical state spaces A and B, a dynamical state space 
{AB, (AB) + ,IA AB , EabiQab) will be called a composite of A and B, if the following conditions are satisfied: 

• the linear space which carries the state space is AB = A® B, 

• U AB = U A ®U B , 

• if \p A £ A + and u> B £ B + , then ip A <g> uj b £ (AB) + , 

• if M A £ E A andM B £ E B , then M A ® M B £ E AB , 

• if Ga £ Ga and Gb £ Qb, then Ga ® Gb £ Gab, 

• for every M B £ Eb and u> ab £ (AB) + , the vector ui A ond ("conditional state") defined by 

kaA, A s M A ®N B (u AB ) , „ ... „ 

M A (u, A nd )= u a^ M b {uj abI forallM A £E A (A3) 
is a valid state, i.e. oj A ond £ A + , and similarly for A and B interchanged. 



Note that eq. (|A3j) is automatically satisfied if all effects on A are allowed. It means that we cannot get "new" 
states outside of Qa by preparing global states and postselecting on local measurement outcomes. Similarly, we might 
demand that any map of the form 

u A ^M AB {u) A ®w B ) 

for a fixed bipartite effect M AB and fixed state ui B is itself a valid effect on A. If this is violated, then the set of 
possible local measurements on A is increased by composing it with the other system B. However, since we do not 
need this condition in the following, we decided not to have it as part of the definition in order to have a result which 
is as general as possible. 

By setting Af B :— U B in eq. (A3), we obtain the conditional state which Alice sees if Bob does not perform any 
measurement. This is the reduced state oj a of lu ab £ Qab, satisfying 

M A {u A ) =M a ®U b (lo ab ) iov&W M a £E a . 



Thus, Definition 11 ensures that global states have valid reduced states (marginals). 
We continue by proving two claims in the main text in the following two lemmas: 

Lemma 12. With the notation of the main text (in particular, x = (1, 0, . . . , 0) T ), there is some outcome Iq and 
some direction y £ R d , \y\ = 1, such that M y ia) {uj{x)) > M { 1°J (ui(x)). 

Proof. As we have shown in the main text, the probabilities My\u}(x)) depend only on the first component y\ of y. 
Suppose that M v (u)(x)) = M < d y (uj(x)) for all i and all y. Let S £ SO(d) be the matrix S := diag(— 1, —1, 1, . . . , 1) 
(if d = 1 or d = 2 take S — —1), then it follows that M. y l \u>(x)) = M^g y {u{x)) for all i and y. Now consider the 
following two situations under which Alice attempts to send the spatial direction x — (1,0,..., 0) T to Bob: 

1. Bob's laboratory is aligned in exactly the same way as Alice's - that is, both share the same coordinate system 
(maybe by chance). In this case, Bob's coordinates xb of x are the same as Alice's: xb = (1,0, ... ,0) T . 

2. Compared to Alice's laboratory, Bob's lab is oriented differently, namely it is rotated by S relative to Alice. In 
this case, Bob's coordinates xb of x are xb — (—1,0,..., 0) T . 

Since Alice does not know which of the two situations (or any of the infinitely other possible ones) apply to Bob's 
laboratory, her encoding x h-> uj{x) must work in both cases. However, due to My \uj(x)) = Mg y (uj(x)) for all i and 
y, Bob sees exactly the same outcome probabilities in both cases, leading with probability one to the same estimate 
xb- This contradicts the soundness of the protocol, i.e. Postulate 1. □ 
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Lemma 13. Let io be any outcome that satisfies the statement of Lemma 12 Then there is some direction y G M. d , 
\y\ = 1, such that the state 

u'(y) := / G R u){x)dR (A4) 

J R£SO(d):Ry=y 

has the property that the map z H ► L z (u)'(y)) attains its unique global maximum at z = y. 
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Proof. If d = 1, then eq. (A4) becomes u/(y) = ui(x), and thus L y (u/(y)) = L y (uj(x)) > according to Lemma 
Since there are only two possible directions y = ±1, and since L- y = —L v , this must be the global maximum. 

Now consider the case d > 2. As in the main text, write M z := M ( z a) . Then L z (uj(x)) = M z (uj(x)) - M- z (uj(x)) 
depends only on the first component z\ of z; denote this value by £(z\). This defines a real continuous function 
£ : [—1, 1] — > [—1, 1]. By construction, it is an odd function: £(— z\) — —l(z\), and we know from Lemma [l2| that there 
is some z\ with £(z\) > 0. Since £ is continuous, it attains its global maximum £ max > somewhere. Since the set of 
points where this maximum is attained is compact, and since t(z\) = £ max implies l(—Z\) ^ £ m ax, the expression 

z\ := argmin{|zi| | £(zx) = i max } 

is well-defined. Let y G R d , \y\ = 1 be any direction with first component y\ — z*. By construction, £ max = 



L y (oj(x)) > L z {uj(x)) for all z ^ y. Define u>'(y) as in (A4|. Since L z o Gr = Lr-i z , we obtain 

L v(u'(y)) = L y (u(x)) = £ max . 
Consequently, L- y (u'(y)) = -l max - Let z £ {y,—y}, then 



L z {J{y))= L R - lz (uj(x))dR. (A5) 

J ReSO(d):Ry=y 

We have to show that this is strictly less than £ max . To this end, we define a continuous path on the surface of the 
d-dimensional unit ball. We will assume that z\ > 0; the case z\ < is treated analogously (z{ — is excluded from 
£(0) = -£(0) = < £ max ). For t G [~z*,Zi], let z(t) G R d be some vector with \z(t)\ = 1 such that t H> z(t) is 
continuous, z(— z*) = —y, z(z*) = y, and such that the first component of z{t) equals t. If z ^ {— y, y}, then there is 
some t G [—z*, z*) such that \z — y\ = \z(t) — y\. Hence there is some R G SO(d) with Ry = y such that R~ x z = zif), 
and since \t\ < |^||, we have 

■^H- 1 *^^)) = L z(t){u(x)) = £{t) < £ max . 



But this expression appears in ( A5 ) : the integrand is upper-bounded by £ max for all R, and is strictly less than £ max 
for the rotation R that we have just found. This proves that L z (uj'(y)) < £ m ax- d 

Now we are ready to give a thorough definition of a "direction bit" . It is arguably difficult to formalize Postulates 1 
and 2 from the main text into a rigorous mathematical definition: rigorously defining what is meant by a "protocol" 
seems hardly worth the effort (the result would be long and not very illuminating); similarly, a formalization of the 
physical intuition about spatial symmetries (rotating the device versus the direction bit etc.), as used in the initial 
stage of the proof, seems over the top for the purpose of this paper. Instead, we use two consequences of Postulates 
1 and 2, called Assumptions 1 and 2, as derived in the main at an intermediate stage of the proof, to write down 
a definition of direction bits. This avoids talking about the physical background situation, but ensures that all the 
"convex state space" argumentation rests on rigorous mathematical grounds. 

The meaning of the assumptions is as follows. Assumption 1 states that the standard protocol which we have 
constructed in the main text works: There is some state u> which may serve as a codeword for some direction x in the 
standard protocol. This is because the quantity L y (u>), i.e. the difference of probabilities in directions y and (—y), 
has unique maximum in y — x. Assumption 2 formalizes the consequence of applying Postulate 2 to the special case 
of the standard protocol, proven in the main text: if two states encode the same direction in the standard protocol, 
with the same maximal value of L, they must agree. Assumption 3 subsumes Postulates 3 and 4. 

Definition 14 (Direction bit). For d G N, a dynamical state space (A, A + ,U A , Ea, Qa) together with a distinguished 
continuous representation of S 0(d) (resp. 0(1) if d = X) as a subgroup of Qa (denoted R n- Gr), a distinguished 
vector x G K d with \x\ = 1, and a distinguished effect M G Ea, < M. < 14 A , will be called a direction bit for spatial 
dimension d, if the following conditions are satisfied: 

• For R G SO(d) (resp. 0(1) if d = I), the effect Ai y := A4 o Gr-i depends only on y := Rx. 



19 



• Assumption 1: There exists ui £ Qa with L x (uj) > L v (uj) for all y ^ x, where L y := M y — M- y . 

• Assumption 2: Suppose that uj. uj' € Qa are states with the property that the maps y M- L y (uj) and y h-> L y (uj') 
both have a unique maximum in the same direction yo, and the maximal value is the same: L yo (u>) = L yo (co'). 
Then uj — uj' . 

• Assumption 3: There exists a locally tomographic composite AB of A and B :— A with the property that Qab 
contains a one-parameter subgroup {G AB } t ^«_ for which there exists f 6 K such that G AB cannot be written in 
the form G A ® G B with G A G Q A and G B £ Gb- 

Now the claims of the main text will be proven in detail. 

Lemma 15. Let A be a direction bit for spatial dimension d, with distinguished direction x £ R d . Then there is a 
constant < a < 1 which we call visibility parameter with the following property: 



a = m&x{L x (uj) \ uj £ Qa, L x (u>) > L y (u>) for all y ^ x}. 



Moreover, for every y £ R d with \y\ = 1, there is a unique state ui y £ Qa such that L y (ui y ) = a and L z (ui y ) < a for 
all z 7^ y. Furthermore, Gs^Jy — ojs v and M y o Gs = Ms~i y f or all S £ SO(d) (resp. S £ O(l) if d = 1), and the 
maps y t— > ui y and R i— > G R are both homeomorphisms into their images (in the subspace topology). 

Proof. In all of this proof, if d — 1, then all appearances of SO(d) shall be replaced by 0(1). Let x £ R d be the 



direction bit's distinguished direction (cf. Definition 14), and M = M x the distinguished effect. Let uj £ Qa be 
any state with L x (uj) > L y {ui) for all y ^ x (it follows that L x (uj) > 0). Let uj' £ Qa be any other state satisfying 
L x (uj') > Ly{uj') for all y ^ x and at the same time L x {uj') > L x (uj) (if no such state exists, we are done: just set 
a := L x (uj) and ui x :— uj). Define the state /i := f Resold) GuuidR. By invariance of the Haar measure, there is 
a constant (3 > such that A4 v (fi) = (3 for all y, and thus L y (/i) = 0. Set A := 1 — L x (uS) / L x (ui') £ (0, 1), and 
(p := Xfi + (1 — A)cj', then by construction L x (ip) > L y (tp) for all y ^ x, and L x (ip) = L x {ui). Thus, Assumption 2 
implies that ui = ip = A/i + (1 — A)ui' . In summary, all states that have x £ M. d as their unique maximizing direction 
of L. lie on the line which starts at /i and extends through u to infinity. 

Since the state space is compact, this line will hit the topological boundary of Qa m some state that we call ui x . By 
construction, there is some A £ [0, 1) such that u> = A/i + (1 — A)^. But then, L x {u>) > L y {ui) for all y ^= x implies the 
analogous strict inequality for uj x . Set a :— L x (ui x ), then it has the claimed property. For every y £ K d with \y\ = 1, 
choose some R £ SO(d) with Rx = y, and set uj y :— Gruj x . Since L y — L x o Gr-i, we have 

L y (ui y ) = L x (G R -iui y ) — L x (uj x ) = a. 

Let z ^ y be an arbitrary vector with \z\ = 1, and let S £ SO(d) be any transformation with Sx = z. Then z ^ Rx, 
hence R~ x Sx ^ x, and so 

L z (u y ) — Ls x {Gruj x ) — L r -i Sx (uj x ) < a. 



It follows directly from Assumption 2 that ui y is the unique state with these two properties. Recalling Definition 14 
we also have 



My o G s = M o G R -i oG s = Mo G R -i s = M o G {s -i R) -i = M s -i Rx = M s -i y . 

A simple calculation also shows that uj := Gsuj y has the properties Ls y {ui) — a and L z {ui) < a for all z ^ Sy, which 
shows that ui = u>g y . Next we show that the map y h- > ui y is continuous. To this end, let {y n }neN be a sequence of 
vectors in K d with \y n \ = 1 which converges to some vector y. Clearly, we can find a sequence of orthogonal linear 
maps {-RnjneN with R n y n = y and R n '"—^ i. By continuity of the group representation, we have G Rn '"—^ i ) anc l 
thus 

\ u v - w y„l = \ u R n y n - ^yJ = \Gn n u yn - uj Vn \ < \G Rn - • \uj y J 

since the state space is compact. Since ui y ^ ui z for y ^ z, the map y i— > ui y is a continuous injective map from the 
compact unit sphere in W 1 to its image. Thus |63l 164] . it is a homeomorphism into its image. 

Similarly, the calculations above show that R^ S implies that G R ^ Gs- Since the map R i-> G R is continuous, it 
is a homeomorphism into its image. □ 

The next lemma also serves as a definition of the maximally mixed state. 
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Lemma 16. Let A be a direction bit for d spatial dimensions. Fix any x £ K with \x\ = 1, and define the maximally 
mixed state by integration over the Haar measure of SO{d) (resp. O(l) if d — I), 



H := I G R uj x dR = / 0J Rx dR. 

JR^SO(d) JR^SO(d) 

The resulting state fj, does not depend on the choice of x. Moreover, there is a constant < c < 1 such that Ai x (/i) = c 
for all x £~R d with \x\ = 1, and G r ^l = /i for all R £ SO(d) (resp. R £ 0(1) if d = 1). We call c the noise parameter 
of the direction bit A. 

Proof. If d > 2, it follows from G R ui y = uj Ry that the definition of [i does not depend on the choice of x. The identity 
Gr/j, = n follows from the invariance of the Haar measure. Set c := M x (fi), and let y £ M. d be any vector with \y\ = 1. 
Then there is S € SO(d) with y = Sx, thus M y = Ms x = M x o G s -i, and so 

M v (n)= M x o G s -i (uj Rx )dR = M x \ u] s -i Rx dR = M x (p) = c 

JReSO(d) JR£SO(d) 

again by invariance of the Haar measure. For the case d = 1, replace all appearances of SO(d) in the proof by 
0(1). ' □ 

Lemma 17. Every state uj of a direction bit can be written in the form uj = Xuj x + (1 — X)/i, with < A < 1, some 
direction x with \x\ = 1, and /i the maximally mixed state. 

Proof. Let uj be an arbitrary direction bit state. By compactness of the unit sphere and continuity, there exists x £ M. d , 
\x\ = 1 such that L x (ui) > L y (uj) for all y £ M. d with \y\ = 1 (there may be several maximizers x; we choose one of 
them arbitrarily). For < e < 1, define oj e := (1 — e)oj + eu x . Clearly L x (lj s ) > L y (u> E ) for all y ^ x. According 



to Lemma 15 we have a = L x (u) x ) > L x (uj e ), and thus a > L x (uj) by continuity. Define uj' e := \ e uj x + (1 — A e )yu, 
where X e := e + (1 — e)L x (ui)/a £ (0, 1]. It follows that L x (uj' £ ) > L y (uj' e ) for all y ^ x. Moreover, L x (fi) = implies 
L x {u>' e ) = L x (uj e ), so Assumption 2 proves that u>' E — u e . Since A := lim^o A £ exists (and equals L x (uj)/a), we can 
take the limit e — > of this equation and obtain uj — Xui x + (1 — X)fi. □ 

Lemma 18. Every direction bit state space for spatial dimension d is a d-dimensional unit ball. 

In more detail, every direction bit state space for spatial dimension d is equivalent to the dynamical state space 
(B,B + ,U b ,E b ,Gb), where 

• B = R d+1 , 

• Q,b = {(lj r ) T I r G \r\ < 1}, i.e. Hb is a Euclidean unit ball of dimension d, 



U B (x Q ,xi, . . . ,x d ) = x , 

• we have SO(d) C Q B (resp. O(l) C Qb if d = I), where the inclusion means that a rotation R £ SO(d) (resp. 
R £ 0(1) if d = 1) acts on states as G R (l,r) T — (1,Grx) t , where G R = ORO -1 for some fixed orthogonal 
matrix O that does not depend on R, 

• the set of allowed effects Eb contains the "noisy spin measurements" M x {ui) — c + (a/2)(uj x ,uj) , where x £ R d 
with \x\ = 1 is a fixed unit vector, u) denotes the vector corresponding to the state ui via uj = (1,uj) t , uj x is the 
analogous vector corresponding to the state uj x and satisfying \uj x \ — 1, while a and c are the visibility and noise 
parameters. 

Proof. If d = 1, then replace all appearances of SO(d) in this proof by 0(1). Let (A, A +1 U A , Ea, Qa) be a direction 
bit for spatial dimension d. Let D := dim A — 1. We can reparametrize the normalized state space Ha by an affine 
map <f> : A 1 —> M. D , where A 1 := {x £ A \ U A {x) = 1} is the affine hyperplane that contains the normalized states. 
We define by first setting M(oS) := uj - fx, and G R := M o G R o M _1 for R £ SO(d). Both M and all G R are 
affine maps; moreover, G^(0) = 0, hence G R is a linear map, or a D x D-matrix. Define the positive matrix X > 
by X := f ReSO(d) G^G R dR, then G^XG S = X for all S £ SO(d). Now we set <j>{u) := aVx(uj - fjt), where a > 

is a constant that we will determine later. Let (•, •) denote the standard inner product on M. D , and let x and y be 
directions, and R £ SO(d) a rotation with y = Rx. Abbreviate uj := 4>{uj). Then 

2/./vc . ,,\ ./yi, . ,.\\ 2 



ujy,uj y ) = a {VX(uj y - fj,) , w X {uj y - fj,)) = ct (uj y - n,X(uj y - (j,)) = a (G r {uj x - h),XG r (uj x - //)) = (uj x ,uj x ) 
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Hence, by choosing a > appropriately, we achieve that the Euclidean norm satisfies \ui x \ — 1 for all directions x. 
Now we define a linear map L : A — >• M D+1 by linear extension of L(ui) :— (l,w) for all u> £ A 1 . This is clearly an 
invertible map. Set B := R D+1 , il B := LQ A , and W B := U A o L^ 1 . Then a vector x £ B satisfies U B (x) = 1 if and 
only if Z/ A (L -X (a;)) = 1, that is, iff ^(x) £ A 1 , or x G L(^4 1 ), which is equivalent to x = (xo, . . . , xd) with xo = 1. 
Thus, U B (x) — xq as claimed. 



Due to Lemma 17 every € J7b can be written in the form = AX(a; a ;) + (1 — X)L(/i) with some A € [0, 1]; 
that is, (l,w) T = A(l,w x ) T + (1 - A)(1,0) T . Thus \u)\ = X\lj x \ < 1. That is, the set of all a), i.e. A B := {a> | w € Oa} 
is a compact subset of the Euclidean unit ball in M. D (and it has full dimension D by our background assumptions). 

Suppose that jx = was a (relative) boundary point of |65j . Then there would be a non-trivial supporting 
hyperplane to fls which contains /t; that is, an affine functional / : B 1 — > K (where B 1 := {x £ B \ xq = 1}), such 
that f(fi) = and f(ui) > for all ui £ &b- If we had f(Cu) = for all £ Qb, then would be fully contained 
in the corresponding hyperplane, contradicting the fact that it has full dimension. Thus, there are states u with 
f(&) > 0. Since every u> ^ jx equals \uj x + (1 — X)fi for some A € (0, 1] and some direction x, this proves the existence 
of some directions x with f(fi x ) > 0. Thus, 



/ LbR X dR =>■ /(A) = / f(u Rx )dR > 0. 

JReSOld) JReSO(d) 



IR<£SO(d) JR<£SO(d) 

This is a contradiction. Thus, /i must be in the relative interior of £Ib- That is, there is some e > such that the e-ball 
around fi = is contained in f^. Now let v £ M. D be any vector with Euclidean norm |u| = 1. Then (e/2)v £ &b, 
that is, (s/2)v = oj for some u £ £Ia- Thus, there is some A £ [0, 1] and some direction x £ M. d such that tli = \Cj x . 
Since \ui x \ = 1, this is only possible if A = e/2 and v = Cj x . But this implies that v £ VIb- We conclude that all unit 
vectors are contained in £Ib ~ thus, by convexity, Qb is the full unit ball. Since all points on its surface are of the 
form 6j x for some direction x, Lemma [l5| implies that the map x i— > uj x is a homeomorphism from the unit sphere in 
R d to the unit sphere in R D . It follows that d — D. 

Set Qb '■= L o Q A o L~ l . Suppose d > 2. Let R H> be the distinguished continuous representation of SO(d) 



according to Definition 14 and let Gr :=Lo G^ o L 1 £ Qb- Since reversible transformations preserve the normal- 



ization, this is only possible if Gn(l,r) T = (l,Gnr) T , where every Gr preserves the Euclidean norm. The results of 
imply that the map R i— > Gr is continuous and injective, hence Qb ■— {Gr \ R £ SO(d)} is a compact 



15 



Lemma 

connected subgroup of O(d) containing the identity, hence a Lie subgroup of SO(d). Since R i— > Gr is in particular 
injective, this is only possible if Gb = SO(d). In other words, the map R >-> Gr is a continuous group automorphism 



of SO(d). According to Lemma 23 below, there exists an orthogonal matrix O £ O(d) such that Gr = ORO . 
On the other hand, if d = 1, then every Gb & Qb preserves the set of pure states {(1, 1) T , (1, — 1) T } as well as the 



normalization. This is only possible if Gi(l,r) T = (l,r) T and G_i(l,r) T = (1,— r) T . 

Let x be the direction bit's distinguished direction as given in Definition |14| and M. x the corresponding allowed 
effect. Let Eb '■= Ea ° L" 1 , and in particular £ x := A4 X o L^ 1 . By linearity, we can write £ x in the form A4 x (co) = 
£ X (1,U)) T = a + f3(v x ,uj), where a £ R, (3 > 0, and v x £ R d is some unit vector. First, M x (fi) = c and p, = 
implies a = c. For every rotation R £ SO(d), acting on direction bit states via Gr, denote by Gr the corresponding 
transformation in the ball picture, i.e. 

L o Gr o L _1 (l,tj) = (1, Gruj) for all cu £ Q A . 

We know that Gr £ SO(d), too. For arbitrary directions y £ ]R d , \y\ = 1, choose R £ SO(d) with Rx — y, then 

M y (u) =M Rx (u) = M x oG R -x(w) =c + /3(v x ,G R -iu) = c + fi(G R v x ,Cj). (A6) 

If d = 1, we have G_i = -1; if d > 2, this follows from G_i = 0(-l)0" 1 = -1. Thus 

L x (u) y ) = M x (oj y ) - M-i. x (u y ) = M x (oJy) - M x o G_i(cj y ) = (3 (v x , u y ) - (3{v x , G-xCby) = 2/3(V X ,LOy). 

This expression attains its maximum in y for y — x, thus v x = to x . It follows that a = L x (lo x ) = 2f3(Ca x ,uj x ) = 2/3, 



hence /3 — a/2, and we obtain A4 x (cu) = c + {a/2)(uj x , uj). Due to eq. (A6), the analogous equation holds true for 
all other directions 5/1. Thus, we know that all these M. y must be allowed effects, i.e. elements of Eb- In the 
following, we always assume that we have chosen the ball representation right from the start, such that £ x = M. x . □ 
Lemma [18] implies that direction bits have at most two perfectly distinguishable states in their state space, and 
not more. This justifies the name "direction bits". In more detail, if A is any state space, call a set of states 
u>i, . . . ,oj n £ fiyi perfectly distinguishable if there are effects £\, . . . ,£ n £ E A with £ % + . . . + £ n = U A such that 
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£i(uij) = Si.j, that is 1 if i — j and otherwise. The maximal number of any set of perfectly distinguishable states 
will be called the capacity Na [18]. In the special case of a quantum system, Na equals the system's Hilbert space 
dimension. The following lemma is well-known in the context of general probabilistic theories; we give the proof 
for completeness. It says that ball state spaces of any dimension d are bits, i.e. have capacity N — 2; this includes 
classical bits (d = 1) and quantum bits {d = 3) as special cases. 

Lemma 19. If A is a Euclidean ball state space with all effects allowed, i.e. 

A = R d+1 , n A = {{l,rf \r&R d , \r\ < 1} , E A = E% ax = {M g A* + | < M(u) < 1 for all u G Sl A }, 
then it has capacity Na = 2, i.e. it is a generalized bit. 

Proof. Let r € R d be any unit vector, \r\ = 1. Set cjj := (l,r) T g Ha and uj 2 := (1,— r) T g f^, then the two 
functionals 

fiM ~ ^MV), £a(w) := \{{h~r) T M 

are effects in Ea that perfectly distinguish uj\ and and sum up to U A . Thus Na > 2. 

Suppose there are n > 3 perfectly distinguishable states Wi, . .. ,u> n g fi^, with corresponding effects £\, . . . ,£ n . 
Consider the hyperplane H :— {x £ R d+1 \ £\(x) = 0}; it is a support hyperplane [65] of Qa- Furthermore, since 
u>2, • ■ ■ , w n g H, it contains more than one point of f^, so PI is a face of that contains more than one point. 
However, all faces of Euclidean balls contain only one point; we obtain a contradiction. Hence Na < 2. □ 



According to Lemma 18 direction bit state spaces are Euclidean balls. In the case that all effects are allowed 
(which, as we show later, corresponds to the "noiseless" case with visibility and noise parameters a = 1 and c = 1/2), 
they have therefore capacity N — 2, i.e. they are in fact bits as the name suggests. If not all effects are allowed, then 
direction bits are noisy versions of bits (formally they have capacity N = 1). Thus, in contrast to von Weizsacker [T], 
we do not assume from the beginning that our physical systems under consideration are 2-level systems, but we prove 
this from the postulates. 

Corollary 20. Given the noise parameter < c < 1 of any direction bit, the intensity parameter a satisfies < a < 
2 min{c, 1 — c}, and the spin measurements satisfy M x (cu) e [c — a/2, c + a/2] for all \x\ — 1 and us € Q.A- 



Proof. We know that a > due to Lemma 15 In A4 x (lo) = c + (a/2)(uj x , uj), the inner product can attain any value 
in the interval [—1, 1] by choosing Co in the unit ball appropriately. But M x (ui) is an outcome probability, hence in 
the interval [0, 1]. Working out the corresponding inequalities proves the claimed constraint on a. □ 
Now that we know that direction bit state spaces are unit balls, we can say a bit more on the set of possible 
protocols satisfying Postulates 1 and 2. Surprisingly, dimension d — 2 turns out to be special, as illustrated in Fig. [8] 
below. 

Lemma 21. Consider any protocol satisfying Postulates 1 and 2, under the additional requirement that every state 
ui 7^ /i may be used to encode some direction x(oj) g M. d , \x(oj) \ — 1, such that Bob's decoding ui i— > x(u>) is a continuous 
map. If d 2 then there is an orthogonal matrix O G O(d) such that 

that is, up to a fixed rotation (and possibly reflection), physical directions are encoded into Bloch vectors that point 
into the corresponding direction in state space. In particular, for d ^= 2, if uj and tp encode the same physical direction, 
then there is X g [0, 1] such that uj = \tp + (1 — A)/i or vice versa (i.e. with u> and tp exchanged) - that is, one of the 
states is obtained from the other by adding uniform noise. 

Proof. Suppose that d = 1. Then every state u> ^ /x has one-dimensional "Bloch vector" w 6 [— 1, 1] \ {0}. There are 
two possible directions, +1 and —1, which have to be encoded in accordance with Postulate 1. If this is done in a 
continuous way, the only two possibilities are 



x(u>) 



±1 if Cj > 0, 
Tl if u> < 0, 



proving the claim. Now consider the cases d > 3. For every x g S 1 , define the stabilizer subgroup Q x := {R € 
SO(d) | Rx — x}. Let w / /i be an arbitrary state. From the main text, we know that R g Q x (u) implies that 
Grlo = uj, hence GrCj = uj = 0~ 1 ROuj with some orthogonal matrix O, so R € Goti- We get G x r u ) C Goa- Since 
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FIG. 8: In d — 2 spatial dimensions, Alice and Bob may use a protocol that is unavailable in other dimensions: they may 
agree that Bob decodes mixed states with a purity-dependent rotation. That is, if Bob obtains many copies of some state uj 
and determines r := \il>\, his output will be R r x, where x is the direction encoded in the pure state with Bloch vector to/r, and 
R r S SO(d) is a rotation depending on r. The figure shows possible level sets of states in the disc state space that encode the 
same spatial direction. This strategy is impossible in higher dimensions, because without any shared reference frame, Alice 
and Bob will not be able to agree on a 2-dimensional reference subspace which carries the corresponding rotation. As a result, 
the level sets must be straight lines for d > 3, as proven in Lemma [2l| 



both groups are isomorphic to SO(d — 1) this implies equality. Since d > 3, this in turn implies that x(uj) is parallel to 
Ouj; thus, there is a sign a(uj) £ {— 1, +1} such that x(ui) — ct(uj)Ouj/\uj\, and the sign (plus or minus) cannot depend 
on uj due to continuity of uj t— > x(cj). This proves the claim after possibly redefining O > (—0). □ 
We now prove a technical lemma which is related to the claim in the main text that the angles inferred from state 
space must agree with those in physical space (discussed in more detail in Appendix [C] below). We show that the 
map x i — y &x which maps direction vectors x £ M. d , |x| = 1, to pure states' Bloch vectors uj x is linear: there is some 
orthogonal matrix O £ 0(d) such that uj x = Ox. This follows from the following lemma: 

Lemma 22. Let R i— > Gr be a continuous group automorphism of SO{d), and suppose x i— > Cj x is a continuous map 
of the unit sphere S 11-1 :— {y £ M. d \ \y\ = 1} to itself such that Gruj x = ujr x . Then there is an orthogonal matrix 
O £ 0(d) such that Qj x = Ox. 



Proof. According to Lemma 23 there is an orthogonal matrix O £ 0(d) with Gr — ORO^ 1 for all R £ SO(d). The 
lemma will be proven by distinguishing several cases. 

First, consider the case that d is odd. Let x £ K d , |x| = 1 be arbitrary, then there is some R £ SO(d) for which the 
multiples of x are the only eigenvectors of eigenvalue 1, i.e. Ry = y with y £ M. d is equivalent to y — ax with a£l. 
But then 

ORO~ x Gj x = G r oj x = Gj Rx =Cj x R^^ljx) = 0~ x Qj x , 

and so 0^ 1 uj x £ {— x,x}. Since this is true for every direction x, and the map x t— > ui x is continuous, we either have 
Gj x = Ox for all directions x (in which case the lemma is proven), or u> x = —Ox for all directions x, in which case we 
can replace O by (—0) and obtain the statement of the lemma as well. 
Next, consider the case d = 2. For every x £ K d with \x\ — 1, define 

R x := ( Xl ~ X2 \ £ 50(2). 

We have Qj x — G^iin QW = OR x O~ x Cj(Y Qy a ma P x ^ &xi this is manifestly linear. Since it preserves the 
Euclidean norm, it must be orthogonal. 

Finally, consider the cases of even d > 4. Let S C M. d be any 2-dimensional subspace. Suppose that x £ S. Clearly, 
there is R £ SO(d) which acts as the identity on S (and nowhere else), i.e. y £ S Ry — y. But x £ S, hence 
Rx — x which implies GrCj x = Cjr x = uj x , so i?(0 _1 w 2 ;) = 0~ x uj x , and thus 0^ 1 cu x £ S. 

Now let S, S' be two 2-dimensional subspaces with S fl <S" = spanjx}. Since x £ S and x £ S' , we have O~ x 0j x £ 
SdS' , so 0~ x Gj x £ {— x, x}. Similarly as above, we conclude that either 0~ 1 £j x = x for all directions x, or 0~ x Gj x = —x 
for all x, in which case we can redefine O n- (— O). □ 

Lemma 23. All continuous automorphisms of the special orthogonal groups <f> : SO(d) — > SO(d) are of the form 
(l):g^ PgP^ 1 with P £ 0(d). 
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Proof. Since every continuous homomorphism of Lie groups is analytic 66J, <f> induces an automorphism on the Lie 
algebra so(d), uniquely determined by its action on the neighborhood of the identity. But not every automorphism 
of a Lie algebra q necessarily induces an automorphism on the corresponding group G. Rcf. [67J contains the auto- 
morphisms of the Lie algebras so(d), and in what follows, we figure out which of these correspond to automorphisms 
of 50(d). 

One particular type of automorphisms for both, G and g, are conjugations by group elements, that is I 4 
gXg -1 where g E G. These are called inner automorphisms. Proposition D.40 from [67: (page 498) tells us that all 
automorphisms of a Lie algebra g are generated by the inner automorphisms times the symmetries of the associated 
Dynkin diagram. The Dynkin diagram of so(2n + 1) has no symmetries, hence all the corresponding automorphisms 
are inner. This proves the lemma for odd dimension. 

Exercise 22.25 in [67] (page 362 with answer on page 529) states that for n > 5, the symmetry of the Dynkin 
diagram of so(2n) is implemented by a conjugation X H> PXP^ 1 , with P 6 0(2n). This proves the lemma for even 
dimension d > 10. In what follows we consider separately the cases d — 2, 4, 6, 8. 

Case d = 2. The Lie algebra so (2) is a one-dimensional real vector space with trivial commutator. Hence, the 
automorphisms are X H ► aX for any real a. It is easy to see that among these, the only ones which induce an 
automorphism in SO(2) are the identity and a = — 1. The second one can be implemented as 

1 \ / 1 



I -i) 9 \o 1 

Case d = 4. In page 274 of Ref. [67], it is shown that so (4) = su(2) © su(2) = so(3) © so(3). Hence, all the 
automorphisms of so (4) are those of so (3), which as shown above are inner, together with the exchange of the two 
summands in so(3) ©so(3), which can also be implemented by conjugation. 

Case d — 6. The standard representation of 50(6) is equivalent to the antisymmetric product of two copies 
of the standard representation of SU(4) (see page 284 in [S7J), which is irreducible. This also implies that this 
representation of 5C/(4) is real, and hence, equivalent to its dual (or complex conjugated) representation (see page 
218 in [67]). Exercise 22.25 in 67 (page 362 with answer on page 529) shows that the symmetries of the Dynkin 
diagram of SU(A) are implemented by complex conjugation X H> X* . Since the representation of 5t/(4) equivalent 
to the standard representation of 50(6) is real, complex conjugation leaves the algebra and the group invariant. So, 
the only automorphisms of so(6) and 5*0(6) are inner. 

Case d — 8. In this case the Dynkin diagram has the larger symmetry called triality. Section 20.3 in [67] shows that 
this symmetry permutes the defining representation of so (8) and the two fundamental spin representations S + , S~ . 
This cannot be a symmetry of 50(8), since the exponentiation of 5 + or S~ gives the group Spin(8), which is different 
from 50(8). So the only nontrivial automorphisms of 50(8) are inner. □ 

We will now show that it is sufficient to consider "optimal" direction bits, i.e. ones with visibility parameter a = 1 
and noise parameter c = 1/2. The idea is to take the state space A of a direction bit with c ^ 1/2 and/or a < 1 and 
to modify it by allowing all effects A* + . The bipartite state space of two modified direction bits is then defined as the 
orbit of Gab on the product states and effects. However, it has to be shown that this results in a valid state space; 
in particular, all probabilities must be positive. This is shown in the following lemma. It uses the definition of i?™ ax 



as given in eq. ( Al I 



Lemma 24. Suppose that A = (A, A + ,U A , Ea,Ga) is a direction bit for spatial dimension d with arbitrary visibility 
and noise parameters a and c, with joint state space for two direction bits AB = (A®B, (AB) + ,U A ®1A B , Eab, Gab)- 
Then 

A' := (A,A+,U A ,ET x ,Ga) 

is a direction bit for spatial dimension d with visibility parameter a' — 1 and noise parameter c' — 1/2, with a possible 
state space of two direction bits given by 

A'B' = {A®B, Gab(A+ ® B + )M A ®U B , (ET X ® E^) ° Gab, Gab), 

where Gab{A+ ® B + ) is the convex hull of all unnormalized states of the form G(lu a ® uj b ) with G € Gab an d 
lo a G A+, uj b G B+, while (£^ ax ® E™ ax ) o Gab is the convex hull of all effects of the form (A4 <Ei J\f) o G with 
M G £^| lax , M G E l g ayi , and G G Gab- 

Proof. Throughout the proof, if d = 1, replace SO(d) by O(l). Clearly, A' is a valid state space. We know that to 
every direction x G M d , there is a state ui x G £Ia such that M. x {w) = c+ (a/2)(uj x ,CS) for all ui G Qa- The linear map 
A4' x (uj) := 4 + ^(uj x ,<2i) is in [0, 1] for all uj G Ha, hence contained in i?™ ax = Ea>- It is easy to check that 

M' x = -M x +(\--)u A . 
a \ 2 a I 
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Let Ai be the effect from Definition [TIJ and Ai 1 the effect on A' related to it by the previous equation. Using the 
notation from Definition 14 where R G SO(d) is a rotation with y = Rx, we have 



M' o G R -i = -Ai o G R -i 
a 



U A o 



G R -1 = -My 

a 



U A =M'„. 



Thus, the prerequisites and Assumptions 1 and 2 from Definition |14| are satisfied for A'. In order to show that 
Assumption 3 holds true, we have to prove that A'B' is a valid composite of A' and B' . 

Clearly, (A'B') + = Gab(A + ® B + ) is a closed convex cone, and (A'B') + C [AB) + implies that Qa'B' — W G 
(A'B') + | U ab {oj) = 1} C is closed. Since Qab is compact, so must be Qa'B'- Since A + ® _B + spans the full 
space A ® B, so does (A'£>') + . This shows that (A'B') + is a proper cone. 

According to Definition II all that remains to do is to show that (E A ax ® E^ ax ) o Q AB is a valid choice for E a *b' ■ 
Clearly this is a closed convex set spanning A® B. It remains to show that all its elements are non- negative and no 
larger than one on VLa'b 1 ', by convexity, we only have to show this for the elements {Ai ®Af) o G, where Ai E i?™ ax , 
Af E E B la * and G G Gab ■ Finally, convexity for the state cone additionally implies that it is sufficient to prove that 



< Ai ® Af (G(u A ® u B )) < 1 for all Ai e E^ x ,Af e E 



max , ,A 
UJ 



e n A ,u B g n fl) Geexfl. 



(A7) 



The set E A ax is easy to characterize: for every effect Ai £ E A ax , there are A, n € R and x € M d with |x| = 1 such that 
A4(oj) = A ((lo x , Cj) +1) + k for all u> G Ha- A negative sign of A can be removed by the substitution Cj x h- —uj x = &- x , 
so we may assume A > 0. Since A4(uj) 6 [0, 1] for all u E Cl A , we get < n < 1 — 2A and A < 1/2. It follows that 



Ai 



2A 



-Mn 



We can express Af E A* + in an analogous form, replacing A, k, x by A', k' , y. Since conditional states are included in 
the local state spaces by definition, eq. ( A3 ) and Corollary 20 imply for every uj ab E £Iab that 



Ai x ® Ai y {uj 



AB\ 



(-1) 2 .(-1) ! 



where uj A ond is the conditional state on A after having obtained Ai y on B, and u> B is the marginal on B. If uj a E FIa, 

G(uj a ® uj b ) i s a v alid state in SIab- Using that 



G Ob, and G G ^b, then by definition the vector lj 



ab 



AB\ 



Ai y {u) ) with w the marginal of w , the expression in eq. (A7) can be lower-bounded by 



Ai ®Af{w 



AB\ 



> 



4AA' 



All — 



2c 



2A 


A' 
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2A' 

a 



a 



= kk' > 0. 



(A8) 
(A9) 



An analogous calculation yields the upper bound Ai ® Af(uj AB ) < (2A + k)(2A' + k') < 1. We have proven that A'B' 
is, as given, a valid composite state space. Finally, we have b' = A4' x (lj x ) — A4'_ x (uj x ) = 1 and c' = Ai' x {p) = 1/2. □ 
We obtain an immediate consequence: 

Corollary 25. There is no direction bit for spatial dimension d=l. 

Proof. Suppose A = (A,A + ,U A ,E A ,g A ) is a direction bit for spatial dimension d = 1, and A' is its optimal modi- 
fication from Lemma [24], with A'B' the composite state space of two modified direction bits. We know that Qa'B' 
contains at least all combinations of product states; that is, Qa'B' C CIa'B', where 



<S> A , 



1 

±1 



1 

±1 




On the other hand, E A > R > contains all product effects. That is, if uj a B E CIa'B', then 

.A'B' 



1/2 
±1/2 



1/2 
±1/2 



> 



2G 



for all possible choices of signs. It is easy to see that this is only possible of ui A B £ & A B : the four inequalities 
give the half-space representation [68] of the tetrahedron $^4/ b> ■ It follows that Q,a> b< Q $a> b< , and thus equality of 
these sets: the state space for two modified direction bits is a tetrahedron, that is, a classical four-level system. It 
has only finitely many (four) pure states; thus, Ga' B' — Gab must be a finite group. This contradicts Assumption 3 
on direction bits. □ 
All dimensions d > 2 for the ideal case a = 1 and c — 1/2 have been examined in [54]: there it is shown that for 
d 7^ 3, all possible composites of d-dimensional ball state spaces have transformation groups that are non-interacting, 
therefore contradicting Postulate 4 resp. Assumption 3 in the definition of direction bits. (In Appendix [Dj we give 



a simplification of a key lemma in [54] for the special case of this paper.) Lemma 24 extends this result to noisy 
direction bits with c 7^ 1/2 and/or a < 1: 

Theorem 26. There are no direction bits for spatial dimensions d 7^ 3. 

As mentioned in [S3] , we prove in [55] that the only possible composite of two noiseless 3-dimensional ball state 



spaces (up to equivalence), under the assumptions of Definition 11 is quantum theory on two qubits. Now we show 



that this extends to noisy 3-dimensional balls, with the only difference that the set of effects might get reduced: 

Theorem 27. Every direction bit for spatial dimension d — 3 (regardless of visibility and noise parameters a and c) 
can be represented as a "noisy qubit" A = (A,A + ,U a ,E a ,Ga), where 

• A is the real vector space of Hermitian 2x2 matrices, 

• A + is the set of positive semidefinite complex 2x2 matrices, 

• the unit functional U A is the map p 1— ► tr(p), 

• Ea is a subset of the quantum effects, containing all maps of the form p t-¥ tr (pM) , with M a positive semidefinite 
2x2 matrix satisfying tr(M) = 2c and the operator inequality M < (c + a/2) • 1, 

• Qa is the projective unitary group, p 1— > V pW with U £ SU{2). 

The joint state space of two direction bits is then by necessity AB = (A® B, (AB) + ,Li A <S> U B ', Eab, Gab), where 

• A® B is the real vector space of Hermitian 4x4 matrices, 

• ( AB) + is the set of positive semidefinite 4x4 matrices, 

• the unit functional U AB = U A (&U B is the map p 1— > tr(p), 

• Eab is some subset of the quantum effects p t— > tr(pM) with M a positive semidefinite 4x4 matrix, < M < 1, 

• Gab is the projective unitary group, p 1— > UpU 1 ' with U £ 5'/7(4). 

Proof. The standard Bloch representation of a qubit shows that the vector space A as well as A + and U A can be 
chosen in the claimed form. In the ball representation, it is clear that the only possibilities for Ga are SO{2>) and 



0(3). The noiseless version A! from Lemma 24 will have the same transformation group. However, it is shown in [53] 
that 0(3) is impossible if we want to construct a global state space A'B' with interaction out of noiseless 3-balls (in 
a nutshell, the group 0(3) would introduce partial transpositions on A'B' which yield negative probabilities). Thus, 
the group must be «SO(3), which (in the chosen representation) is the projective unitary group. 

It is easy to confirm that the special effects Ai x , x £ M 3 , \x\ = 1, can be represented in the following way: for every 
x, there is a complex unit vector \ip x ) £ C 2 with 

M x (p) = tr(pM), where M = a\<p x ){(p x \ + (c - |) 1. 

These matrices have trace tr(M) = 2c. The convex hull of all these matrices is a subset of Ea- 

|m tr(M) = 2c, M <(c+|)-l}c E A - 

As it has been shown in [5S], the noiseless composite state space A'B' equals quantum theory on two qubits. Since 



Ga 1 b' = Gab in the construction of Lemma 24 it follows that Gab must be the projective unitary group as claimed 



(that the vector space is AB = A® B and U AB — U A (g> U B follows directly from the definition of a composite, 



Definition 111. Since the unitary group generates the set of all quantum states from a pure product state, it follows 
that the quantum state space of two qubits, Vfi := {p £ A <E) B \ tr(p) = l,p > 0}, is contained in Qab- Suppose 
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there was any a G Qab \ , then this would be a Hermitian matrix with at least one negative eigenvalue. Using an 
appropriate unitary U, this matrix can be diagonalized and be brought into the form U aW = J ^2 i J=0 Xi.j \i) (i\ ® \j) (j\ 
with Ao,o < 0, denoting by {|0),|1)} a basis of C 2 . Using t he li near functional M(p) := N(p) := (0|p|0), we get 
M. ® j\f(UaW) = A 0j o < 0. However, this contradicts ineq. (A9), which shows that all noiseless product quantum 
measurements M. ® M ovl all bipartite states uj ab G Qab must yield positive probabilities. Therefore Qab = , 
hence (AB) + is the set of positive semidcfmitc (4 x 4)-matrices. 

We do not really know what Eab is: since all its elements must be non-negative on all quantum states, it must 
be a subset of the quantum effects. Since there are no further conditions on Eab in Definition [IT] it could possibly 
coincide with the set of quantum effects, or be a proper subset. All we know is that it contains the unitary orbit of 
all allowed product effects. □ 



Appendix B: Proof of non-existence of frame bits (in a special case) 

We now prove the claim made in Section IV in the main text. Recall the "frame bit" setup as explained in Fig. [6] 
We start by giving a formal definition of a frame bit, modifying and specializing Definition [14] for direction bits. 

Definition 28 (Frame bits). For d > 2, a dynamical state space (A,A + ,L{ a ,Ea,Ga) together with a distinguished 
continuous representation of SO(d) as a subgroup of Qa (denoted R i— > Gr), and a distinguished effect M. G Ea, will 
be called a frame bit for spatial dimension d, if the following conditions are satisfied: 

• For all Y G SO{d), define My ■= M o G Y -i. 

• Assumption 1 ': There exists uj G VLa with A4i(uj) > Aiy(uj) for all Y ^ 1. 

• Assumption 2': Suppose that uj =/= uj' G Qa are states with the property that the maps Y i— > M.y{uj) and 
Y i— > M.y(uj') both have a unique maximum for the same frame Yq. Then there exist < Ay < 1, Y]j Xj = 1, 
and pairwise distinct rotations Rj G SO(d) such that 



My{u') = X 3 M R]Y {uj) for all Y G SO{d) 



j 

or vice versa (that is, with uj and u' interchanged). 

Note that Assumption 1' implies that for every X G SO(d) there exists tp G £Ia with Aix(f) > ■M.y( ( p) f° r au 
Y X. This follows by setting tp :— Gxoj and using A4y ° G^-i = M.xy- 
In analogy to Lemma [7] for direction bits, we can prove the following: 

Lemma 29. Under the premises of Assumption 2' in Definition\28l we obtain 

uj' =^2XjG R jiUJ. 

3 

Proof. Let tp := V ■ XjG R -iuj. Direct calculation shows that My(<p) = My{u}') for all Y G SO(d), hence Y n- My(ip) 

also has a unique maximum in orientation Yq. Denote the maximal value by m := M.y (uj') — A4y (ip). Suppose that 
ip =/= uj' . Then Assumption 2' implies that one of the following two cases must be true: 

1. We have M.y{uj') = J2j t L j-^s j Y( l p) for all Y, where < pj < 1, J2j f-j = 1> an d Sj are pairwise distinct 
rotations. 

In this case, we obtain 

m = M Ya (uj') = fJ>j-Msj Yo (<p) = ^ M SjY (t«/) ■ 

3 j V ~? ' 

<m 

This is only possible if AisjY {^') = m for ah j- By the unique maximum assumption, this implies that 
SjYo = Yq for all j, hence all Sj — 1 are equal, which is a contradiction. 

2. We have M.y(<p) = J2j f^j^S-jY^') for ah Y, where < pj < 1, J2j Pj — 1, and Sj are pairwise distinct 
rotations. Then the argument is completely analogous to case 1. 
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This proves the claim. □ 
In the following, it will turn out to be useful to introduce some abbreviations. Call any state u> with the property 
Mx{u) > My(u) for all Y 7^ X a codeword for X. Furthermore, for every state u, define 

A(uj) :— max My(^) ~ min My(^) ='■ A4 maK (w) — M m i n (u). 
Yeso(d) Yeso(d) 

Note that A is continuous, but in general non-linear. Clearly A(Grlo) — A(lo) for all R £ SO(d). Furthermore, we 
have 

Lemma 30. The map A is convex, i.e. if LOi £ and < A^, Aj = 1, then 

\iA(ui). 



IfO < Xi for all i, we have equality if and only if there exists orientations Y, Z £ SO(d) such that Ai m a,x(ui) = M-y{^i) 
for all i, and A4 m i n (^>i) = -Mz(^i) for all i. 

Proof. Let ip := J^i let Y be some frame with A / i ma x(V') = My(^), and let Z be some frame with A^ m i n (V-') — 
Mzh)- Then 

^2 KA(uj t ) = ^2 (X i M ma . x (u} i ) - XiMmmi^i)) > ^2 (AiMy(wi) - XiM z (u)i)) 



My (E A ^<) ~ M z (E A * 



ujij = M maK (i/j) - M min (ip) = A(»- 

Assume now A^ > for all i. Inspecting the single inequality in the chain above proves the claimed condition for 
equality. □ 
Assumptions 1' and 2' imply the following: 

Lemma 31. Suppose that lo and lo' are both codewords for the same X € SO(d), and A(lo) — A(uj'). Then lo = lo' . 
Proof. Suppose that u) 7^ lo' . Then Assumption 2' implies that lo' = XjG r -ilo for < Xj < 1, Xj — 1 and 



rotations Rj £ SO(d) with Rj 7^ Rp. for j 7^ k (if vice versa, rename whw'). Using Lemma 30 we get 

a(o/) < J2 x ^( g b. 7 ^) = E A ^ A ( w ) = a m- 

j 3 

with equality if and only if there exists Y £ SO(d) with Ad max (G r -ilo) — A4y(G „-iw) and there exists Z € SO(d) 

j j 

with A / i m in(Gp-ia;) = Mz{G B -iu) for all j. Since we have equality by assumption, it follows that 

j 'j 

M max (u) = M max (G R -iu) = My(G„-iw) = Mjj y(w) for all j. 

But since lo by assumption has a unique maximizing direction, we must have RjY — RkY for all j, fc and thus Rj = Rk, 
which is a contradiction. □ 
Now we prove the existence of a unique maximally mixed state, and a bit more: 

Lemma 32. There is a unique state fi such that c AAy^) is constant in Y £ SO(d). Moreover, if lo and lo' are 
both codewords for X with A(lo') < A(lo), then lo 1 = Xlo + (1 — X)fi for X := A(a/)/A(w). 

Proof. Let lo be any AT-codeword. There is at least one "uniform noise" state fi for which A^y(/i) is constant in 
Y: it is [i := f ReS0 ^Gnio dR. Let lo' be another A-codeword with A(lo') < A(lo) (there exists at least one, for 

example lo' = aco + (1 — a)fi for < a < 1). Set A := A(u/)/A(w), and define lo" := Xlo + (1 — A)//', where // is 
any "uniform noise" state, i.e. M.y(p') is constant in Y. It follows that lo" is a codeword for X, too, and we have 



A(lo") — XA(lo) = A(lo'). Thus, Lemma 31 implies that lo" = lo' , and so lo' = Xlo + (1 — A)//. Since this equation is 
true for all uniform noise states //, they must all be equal - there is a unique state fi such that A'If(m) is constant 
in Y. ' □ 



The following lemma is the frame bit analogue of Lemma 15 
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Lemma 33. There is a constant < b < 1 which we call intensity parameter such that for all X G SO{d) 

b = max{ A(w) | lj is a codeword for X}. 

Moreover, for every X G SO(d), there is a unique codeword u>x for X such that A{ljx) — b, and we have uy = 
Gy X -iu x for all X, Y G SO(d). 

Proof. Fix any X G SO(d). Lemma |32] implies that all codewords for X lie on the line which starts at the maximally 
mixed state fx, crosses the state oj(X)^ and extends to infinity. Since the state space is compact and convex, there is 
a unique state ujx at which this line crosses the state space's boundary. By construction, it has the maximal value of 
A(w) among all codewords for X. Set b :— A(u). 

For all Y G SO(d), define uy '■= Gyx- lUJ x- It is easy to check that uy is a codeword for Y, and A(wy) = b. If 
there was any other codeword lj' y =^ lj y f° r Y with A(lj y ) > b, then lj' x := Gxy- iUJ 'y would be a codeword for X 
with A(uj' x ) > b and u' x ^ ujx, which is impossible. □ 

Now we obtain the key lemma: 

Lemma 34. Every state oj can be written in the form lj = Xljx + (1 — X)fi, where X G [0, 1] and X G SO(d). 

Proof. Let oj be any state. By continuity and compactness of SO(d), there is some X G SO(d) such that j\4x{oj) > 
j\Ay(oj) for all Y G SO(d) (in general, X is not unique - choose one maximizer arbitrarily). For < e < 1, 
set lj £ := (1 — e)lj + euix, then uj £ is a codeword for X. If A(oj) = b, then lj = Wj, and we are done due to 



Lemma 33 Otherwise, A{lj) < 6, and so A(oj £ ) < b for all s small enough since A is continuous, and Lemma 32 
implies that uj e — X e uix + (1 — X £ )pL with X e := A(Lj e )/b. We have lim e _>o A(u; £ ) = A(w), and the claim follows for 
A := lim e ^ = A(uj)/b by taking the limit e — > 0. □ 
Exactly the same argumentation as in Lemma [18] - including the introduction of "Bloch vectors" lj for states oj - 
now proves the following: 

Lemma 35. The frame bit state space is equivalent to a Euclidean D '-dimensional unit ball, and the map X i— > ljx 
is a homeomorphism of SO(d) to the unit sphere S . 

Since SO(d) is not simply connected for d > 2, but S ^ 1 is simply connected for D > 3, this is only possible if 
D = 2 and thus (from dimension counting) d = 2. But in this case, frames and directions coincide. 

Theorem 36. "Frame bit" state spaces allowing the protocol in Fig. [6j while at the same time satisfying Assumptions 
V and 2' above, do not exist - unless d — 2, where they coincide with direction bits. 



At first sight, this result may seem surprising, in particular Lemma 35 which says that the frame bit state space 
must be a Euclidean unit ball, exactly as the direction bit state space. The first obvious guess, before doing any 
calculations, would have been that the pure normalized frame states ojx with X G SO(d) can simply be parametrized 
by the matrix X as their "Bloch vector", i.e. ujx — X, similarly as uj x = x for directions x up to an orthogonal 



transformation (cf. Lemma 22 1 



We will now illustrate that this first guess does not work: it results in a state space that satisfies Assumption 1', 
but not Assumption 2', confirming Theorem |36| We only discuss the simplest non-trivial case d = 3. Surprisingly, in 
this case, it turns out that our naive guess reproduces 4-level quantum theory over the real numbers. 

Example 37. Suppose we define a state space f2 with the orthogonal matrices X G SO(3) as the pure states. As 
usual, we have to add a component for the normalization, such that £1 becomes 

Q := conv{(l, X) \ X G SO (3)}. 

The vector space that carries the cone of unnormalized states is 10-dimensional; by construction, for every X G SO(3), 
we have a pure state u>x = (1, X) G fl. Every (mixed) state oj G ft is then of the form lj = (l,Af) with M G R 3x3 
some matrix which, according to [69, Corollary 5.2], has operator norm H-M^oo < 1. We denote this state by uim- 

According to \7(J\ Proposition J^.l], the full state space Q is an orbitope which can be parametrized in the following 
way. Denote the normalized state space of 4-level quantum theory over the reals by £Iq M ; that is, 

^:={ P GM 4x4 |P>0,tr(p) = l}, 
then the state space f2 can be written in the form 

Ull + U 22 - 1133 - "44 2w 2 3 - 2ui4 2mi 3 + 2li 2 4 

M = I 2u 23 + 2u 14 un - u 22 + U33 - u 44 2w 34 - 2u i2 | , U G O 4 

2m 2 4 - 2wi3 2ui2 + 2u 34 un ^ U22 - "33 + U44 



n = < oj m 



QM 
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It is easy to check that the map U M is affine and invertible - hence our candidate fi of "frame bit" state space 
for dimension d = 3 is equivalent to A-level real quantum theory, given that we define effects and transformations 
accordingly (cf. Definition^. 

The obvious choice to set up the representation of the rotation group R i— > Gr, is via Grlox = &RX for all 
X G SO(3). Every special orthogonal matrix X G 5*0(3) has trace tr(X) G [—1,3]. Thus, the following analog of 
direction bits' noiseless spin measurements, describing the probability of the "yes" -outcome in Fig. [6| yields valid 
probabilities in the unit interval for Z G SO(3) (and then for all ujz by convexity): 

M Y {uz) = \ (tr(F T Z) + 1) (Y£ SO(3),lu z e fi). 

The expression tr(Y T Z) =: (Y, Z) is the Hilbert- Schmidt inner product between matrices; the corresponding Cauchy- 
Schwarz inequality proves that My{^z) — 1 if o,nd only if Z = Y . Thus, the protocol given in the caption of Fig. [6] 
works for this state space - that is, the state space satisfies Assumption 1 '. 

However, we will now show that Assumption 2' is violated. Consider the 4x4 positive semidefinite diagonal 
matrices U — ^diag(6, 4, 2, 1) and U' — j^diag(18, 3, 3, 16); they satisfy tr([7) = tr(t/') = 1. Thus, the corresponding 
matrices M = ygdiag(7, 3, 1) and M' — ^jdiag(l, 1, 14) correspond to valid states u>m,<^M' G f2. IfY G SO(3), then 
tr(Y T M) = ~ (7ii,i + 3^2,2 + 1^3,3) ■ But for the standard orthonormal basis {|*)}f = i, we have Y iti = (i\Y\i) < 
UN) II ' 11^1*) II = 1j with equality if and only if Y\i) = \i) . This proves that the identity matrix Y = 1 is the unique 
maximizer of the function M 1— > tr(Y T M), and thus of My 'm) ■ 

However, the same calculation applies to M' , and so we have two states u>m and ujm' that are both valid codewords 



for the frame Y = 1. Thus, according to Lemma 29, Assumption 2' implies that either lom' — X) 7 - ^jGp-iWjf, or 
withuiM andoJM' exchanged. Thus, one of the two following equations must be true: 

M' = J2 X i R J lM > or ( B1 ) 

3 

M = J2*3 R J lM '- (B2) 



For real matrices X, define the norms ||X||i := tWX T X and \\X\\ 2 := ^/tr(X T X), then ||X||i = ||iLX"||i and 



l-^lb = ll-R^lb if R is orthogonal. Now eq. (Bl) implies that \\M'\\k < for k — 1 and k — 2, while eq. FBI 

implies that \\M\\ k < \\M'\\ k for k = 1 and k = 2. However, it turns out that \\M\\i > ||M'||i, while \\M\\ 2 < \\M%. 
Thus, none of the two states is strictly noisier than the other, and Assumption 2' is violated. 



Appendix C: Inferring spatial geometry from probability measurements 

As mentioned in Section [V] in the main text, we now describe an operational procedure that allows observers to 
determine physical angles from probability measurements. 

Imagine some observer (which we call Bob) in rf-dimensional space who holds a direction bit measurement device 
as in Fig. |1J Suppose that Bob does not know how to measure lengths and angles in his local laboratory; say he does 
not have the necessary tools (rulers etc.) to accomplish this. In more detail, Bob may rotate his measurement device 
into some direction x, but he does not know what the resulting direction x actually is, or what rotation he actually 
performed. He lacks the tools to determine angles between possible settings of his device, or between the orientations 
of several different devices that he might hold. 

However, suppose that Bob has access to several direction bit preparation devices. They might just lie around in his 
lab, or they might be physical systems arising in nature, preparing systems in (generally mixed) direction bit states 
w. Again, given any of these preparations, Bob has initially no idea what the prepared state (or its Bloch vector ui) is; 
still, he may operate any of these devices as often as he likes, preparing many independent copies of the corresponding 
unknown state ui. Moreover, we assume that Bob knows the intensity and noise parameters a and c and the outcome 
io, as defined before eq. ([2]), in order to operate his measurement device properly (otherwise we may imagine that 
Bob starts by testing his device on many different states oj to identify a useful outcome io, and to obtain estimates of 
the corresponding parameters a and c). 

We will now show that the ballness of allows Bob to perform a bootstrapped strategy which establishes a spatial 
orthonormal coordinate frame. This allows him to determine the angle between any two possible orientations of his 
measurement device. For simplicity, we first discuss the case of dimension d = 2 here, and give the general protocol 
for d > 2 below. We dismiss the trivial case d = 1. 
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Bob starts by choosing two arbitrary preparation devices at random, preparing two unknown states u>i,u>2' Gener- 
ically, the corresponding Bloch vectors wi,o>2 will be linearly independent (if, for some reason, lo\ and u>2 turn out to 
be (close to) linearly dependent, the protocol will fail and Bob will have to start again). 

Now Bob determines M. x {^i) = c + (a/2)(uj x ,Cj\) for many different directions x by repeated measurements. He 
never knows which direction x he is currently actually measuring, but by trying out many different directions, he 
can determine a good estimate of va.a2t x M. x (uii) — c + (a/2)|a)i| and thus of and he may rotate his device in a 
direction which is very close to the actually maximizing direction x that satisfies lu x — t2>i/|a)i| (still, without knowing 
any coordinate description of x or Cj x ). 

Having done so, Bob leaves his device in direction x, and performs repeated measurements on u>2 to obtain an 
estimate of M x (uJ2) = c+ (a/2)(u} x , cl^), and thus of {Cj x ,Cj2)\Cj\\ = (lji 1 lj2)- Moreover, Bob can also estimate \lo2\ by 
repeating the strategy that he used to determine |c2>i|. This is all the information he needs to determine the coordinates 
of ui and 1I12 in some orthonormal coordinate system. For example, he may choose the coordinates such that lji — 
\lui\ ■ (1,0) T , and o>2 = \<2>2\ ■ (cos a;, sina;) T , where x must be chosen in accordance with (wi,^) = |wi| ■ I&2I • cosx. 

Now suppose Bob rotates his devices in some unknown direction y. By measuring M y (uJi) and A4 y (oL> 2 ) to good 
accuracy, he may determine the overlaps (u) y ,u)i) and (l) v ,u)2) and thus, since Uii,u>2 is a basis, the coordinates of Cj v 
in the given orthonormal frame. 

If Bob holds a second measurement device which points in another unknown direction z, he may do the same thing, 
and altogether compute the angle /-{Cj y , to z ) between the two direction's Bloch vectors. While there was some freedom 
to assign an orthonormal frame to establish coordinates for u)\ and L02, this angle is independent of the specific choice 
of frame. 

As shown in Lemma 22 in Appendix |Aj there exists some orthogonal transformation O € 0(d) such that lu x = Ox 



for all directions x. Thus, if Bob's space carries a metric, such that there is an actual physical angle Z(y, z) between the 
two devices' directions, we have Z(y, z) = Z(cj y , d> z ), and the angle that Bob determines by probability measurements 
must agree with the actual physical angle. 

Now we give a protocol which allows an observer to determine the angle between different direction bit measurement 
devices (or different settings of the same device), by means of probability measurements, in arbitrary dimensions d > 2. 
The protocol will yield more or less accurate estimates of the corresponding angle, depending on the statistical effort 
that the observer spends to obtain probability estimates. We assume that the observer knows the outcome io as 
defined in the main text, as well as the visibility and noise parameters a and c, and the spatial dimension d. 

Protocol 38. In d > 2 spatial dimensions, an observer (called Bob) can estimate the angle /.(y,z) between two given 
measurement devices M y and M. z (acting on systems according to Postulates 1 and 2) by the following protocol: 

1. Bob randomly selects d direction bit preparation devices that he finds in his lab (or in nature), preparing (un- 
known) direction bit states u>i, . . . , L)d- 

The protocol will assume that the corresponding Bloch vectors &i, . . . u)d are linearly independent, which is gener- 
ically the case. Otherwise, the protocol fails and has to be repeated. 

2. For every i = 1, ... ,d, Bob measures uJi in many different (unknown) directions x £ M. d , \x\ = 1. This way, 
he determines max x M x (uji) = c + (a/2)|ca,|, and he can rotate his device close to the (unknown) maximizing 
direction Xi where Cj Xi = (bi/\Cji\, setting the device up to perform the measurement M. Xi . 

3. By measuring the probabilities Ai Xi (ujj) = c + (a/2)((Di, (Dj) /\uJi\ for all j =/= i, he can determine the matrix 
Xij := (u)i, ujj) . 

4- Bob computes any matrix S that solves the equation S T S — X . A solution of this kind exists: any matrix S 
with columns a)^ , . . . , U)' d is a solution, if a>£ is the coordinate representation of UJi in any orthonormal basis. 
Conversely, it follows from the polar decomposition that every solution is of this form. 

Hence, in this step of the protocol, Bob obtains the coordinates of the C)\, . . . in some orthonormal basis. 

5. For any pair of measurement devices pointing in directions y and z, Bob can determine the coordinates of Cj y 
and Cj z in the previously obtained orthonormal basis by measuring M. y (uJi) — c+ (a/2)(uj y ,LUi) and A4 z (uJi) for 



i = 1, . . . , d, and therefore compute Z.(w x , uj y ). But according to Lemma 22, there is some orthogonal matrix O 
such that Gj x — Ox and Gj y = Oy, hence this angle equals Z(x, y). 

As announced in Section [Cj we now give a modification of the direction bit setup, showing that physical space can 
in some situations inherit its linear and Euclidean structure from state space. The following example is not meant to 
describe actual physics in our universe; it is simply a "proof of principle" demonstrating the mechanism under very 
specific conditions. 
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Example 39. Imagine an observer Bob in d- dimensional space, which is simply a topological manifold M . Bob's local 
laboratory is assumed to reside in a (small) part of this manifold, in the vicinity of some point p £ M. We assume 
that there are systems C (say, internal degrees of freedom of particles) described by a convex state space £Iq which is 
also d- dimensional, but not necessarily a Euclidean ball. 

We also assume that there is an analogue of a direction bit measurement device which can "point in different 
directions" and can be "rotated". However, since M does not carry a metric tensor, Bob's local laboratory space does 
not carry an inner product (there is not even the notion of a tangent space to begin with). Thus, there is no literal 
notion of direction vectors or rotations, and we have to define what we mean by these notions in a generalized sense. 

We do this by assuming that there is a special (small) open neighborhood U of p that is homeomorphic to a d- 
dimensional Euclidean ball, with a topological boundary dU homeomorphic to the (d — l)-sphere S^ -1 , such that for 
every x £ dU there is an effect £ x £ Eq which describes the first outcome of the "device pointing in direction x". 
Formally, we only assume that the measurement device can be in different macroscopic states indexed by x £ dU . The 
concrete physical interpretation will be left completely open, with the wording "pointing in direction x " chosen only to 
supply a more concrete mental picture. 

For simplicity, let us assume that we have a 2-outcome device, with outcomes labelled "yes" and "no", such that 
£x(w) yields the probability of outcome "yes" if the device "points in direction x" and is applied to the state u> £ flc- 
For obvious physical reasons, £ x should be continuous in x. A sketch is given in Fig. [9| We make an additional 
important assumption, namely that the effects determine the space points; that is, if x ^ y, then £ x =/= £ y - 



FIG. 9: The topological manifold M and a neighborhood U of a point p with a boundary that is homeomorphic to a (d — 1)- 
sphere. The different "directions" x £ dU in which a measurement device may be oriented are sketched as black arrows, 
illustrating the intuition that a measurement device "points" in the corresponding direction. However, the elements x £ dU 
are in general not vectors in any mathematically well-defined sense; they are only meant to label the different possible states 
of the macroscopic measurement device, leading to different types of measurements. 

The analog of a "rotation" is then any physical transformation which takes a measurement device pointing in 
direction x £ dU to point in some other direction y — H(x) £ dU . Which transformations with corresponding 
maps H are actually possible depends on the physics in Bob 's universe. To comply with some of our intuition on 
rotations, we only consider those transformations that are continuous and can be physically reversed by some inverse 
transformation of this kind. We will assume that the relevant physical quantities (measurement devices, particles etc.) 
are exactly as before if H and then is applied (however, there may be parts of the universe that have changed in 
this process; for example, a distant observer may have noticed the applications of H and then of H~ x and kept some 
memory of this). 

Since these transformations map dU continuously onto itself, and dU is homeomorphic to the (d— l)-sphere S 1 , 
we obtain a subgroup % of homeomorphisms of dU , which can also be seen as a subgroup of homeomorphisms of the 
unit sphere S' d_1 . So far, there is no reason why the transformations H £ H should act linearly; this notion does not 
even have any meaning at this point. We assume that these transformations allow Bob to collimate his device in any 
"direction" x £ dU that he likes; in other words, % acts transitively on dU . 

But now suppose that some of the transformations H £ T-L have impact on the measured outcome probabilities: 
the probability to see the first outcome may change if the measurement device is transformed via H . If it makes 
sense that Bob undergoes the transformation H together with the measurement device such that the device has not 
changed from his perspective ( that is, a "joint rotation"), he will model the net effect on the probabilities by some other 
transformation H' that acts on the state uj instead. It must satisfy the equation 



We will assume that this is possible. Due to the probabilistic interpretation of states, the map H' must be linear. 
Because of the assumed reversibility of H , it must be a reversible transformation, i.e. H' £ Qc- We do not yet know 






(CI) 
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whether this equation and linearity determine H' uniquely. Let uj G flc be any pure state, and define fi :— Jg G{ui) dG 
(which may well depend on the choice of uj). Then in particular H'fi = fi for all possible H' , since H' G Qc- Thus, if 
X,y € dU are arbitrary, there is some H G T-L such that H(x) = y, and 

£ y {n) = £ H ( x) (p) = £ x {H'ii) = £ x (p) =: m > 0. 

Thus, all £ x lie in the d-dimensional affine subspace A := {£ \ £(p) = m} of the (d + 1) -dimensional dual space C* . 
Let A' C A be the affine span of all £ x , and d! :— dim A' . Let h : dU — > S^ 1 be a homeomorphism and j : A' M. d 
an invertible affine map. Then the map s i— > j (£ h -i( s ^ is a continuous injective map from to R d . Due to 



Lemma 40 we must have d 1 > d = dim A, and so A' = A. Suppose that m = 0; in this case, relabel the two outcomes 



of the device "yes-^no" , such that the new £ x satisfy £ x (fi) — 1 for all x G dU . Thus, we may assume that m > 0, 



such that A is not a linear subspace. Consequently, the £ x linearly span C* , and so eq. ( CI) determines H 1 uniquely. 

If H does not alter the outcome probabilities, the corresponding map H' will be the identity map; in particular, 
H i— > H' need not be injective. Again, since the £ x span C* , we obtain 

£ x i— > £h(x) = £x ° H ' =: Lh{£ x ) extends to a linear invertible map Lh from the dual space C* to itself. 

Let H' be the topological closure of the group of all H' , where H G %. Since it is a subset of the compact group 
of reversible transformations of C , it must itself be compact. Let t/i, ... , Vd+i € dU be any set of points such that 
£ yi , . . . , £ yd+1 is a basis of C* . Define the subspace S of C* by 

S:={feC*\ = 0}. 

Then the functionals £ Vi —mU c span S , and we can find d of them which constitute a basis of S . Call these functionals 
J 7 !, . . . , Td (in some arbitrary order). Now we can define a coordinate map A : S 



I:. ( / 



via 



£ = Ai T x + . . . + X d T d «■ A(£) = (A 1; . . . , A d ) T . 

This allows us to define coordinates X(x) of space points x G dU via 

X(x) := A(£ x -mU c ). 

What is the action of H in these coordinates? Since Lh{U c ) = U c ', we have 

A (H{x)) = A{£ H{x) - mU c ) = A o L H {£ X - mU c ) = A o L H o A" 1 X(x). (C2) 

In other words, we have constructed a fictitious d-dimensional linear space such that all x € dU can be represented as 
elements of this linear space, and all transformations H £ T-L act linearly (represented by ho Lh o hr 1 ). This vector 
space structure is inherited from the convexity of probabilities. 

Define the group L as the topological closure of {A o Lh o A -1 | H G H}. Due to Lh{£) — £ ° H' , compactness of 
H' implies compactness of C Thus, there is an inner product on M. d such that (v, w ) = (Lv, Lw) for all v, w G M. d and 
L G C. With respect to this inner product, £ is a subgroup of SO(d). Hence eq. | C2) implies that \\X (H(x)) \\ = ||A(x)|| 
for the corresponding norm, and since H is transitive on dU , we obtain that 

there exists r > such that ||A(x)|| = r for all x G dU. 

Let h : dU — > S^ 1 be a homeo mo rphism, then (1/r) ■ X o h^ 1 is a continuous injective map from the sphere S^ 1 



into itself. According to Lemma 4-1 it must be surjective - in other words, the set {X(x) \ x G dU} is the full sphere 



of radius r in Mr. Since C is transitive on this sphere, we see that C acts irreducibly on Mr; hence the inner product 
(•, •) is in fact unique. 

In other words, we have obtained a unique Euclidean structure on our vector space representation of dU, inherited 
from the group of reversible transformations on state space. 

We can say more about the set of states flc- Let us introduce coordinates on the (d+ 1) -dimensional space C* : for 
£ G C*, set 

£:= (£(fi),A(£-£(fi)U c )) eR d+1 . 

Basically, this extends the coordinate map A from the subspace S to all of C* . In particular, we have U c — (1, 0, . . . , 0) . 
For H G %, set H := 1® KoLh oA _1 , where 1 acts as the identity on the first entry of the vector to which it is applied. 
We introduce an inner product on M. d+1 : ifa,b£R andv,wGM. d , set ((a,v), (b,w)) := ab + (v , w) , where the inner 



34 



product on the right-hand side is the C-invariant inner product constructed above. It follows that (Hx,Hy) = (x,y) 
for all x, y £ ]R d+1 . Now we can represent elements ui £ C (for example states) by vectors oil £ K d+1 7 defined by the 
equation £(oj) — (£,&). For u> £ ilc, we have 1 = U (uS) = (U c ,u>) — u>i, where Q\ is the first component of the 
vector ui. Thus, we can represent every u> £ flc via Co = (l,ui). We get 

{£,H- l Q) = (HE,Q) = {{£{»), hL H {£ - £{n)U c )),u) = ((£(»), A((£ - £ ([i)U c ) o H')),u) = (£^W,oj) 
= {£ o H')(u) = £{H'{u)) = (£, H'(u)). 

Thus, we obtain H'(uS) — Ii~ x Cb. In summary, for two states ip,ui £ Qc, we have 

if = H'{uj) ^ if — H~ x Co if = A o L H -i o A _1 o). 

In other words, TL acts on the subspace containing the Co as the group C; since C is transitive on the unit sphere, 
this implies that the set of "Bloch vectors" {i2> \ uj £ flc} is a Euclidean ball (of some radius), and equivalent to a 
d-dimensional Euclidean unit ball, exactly as direction bits are. Thus, Bob may use Protocol \38\ to determine angles 
between different orientations of his measurement device. In retrospect, we also see that the maximally mixed state /j, 
is unique (it is the center of the ball); hence the linear structure that we have constructed is unique as well. 



The following lemmas have been used in Example 39 



Lemma 40. There is no continuous injective map f : S — > K™ if n < d. 

Proof. It is sufficient to consider the case n = d— 1; otherwise, we can compose / with an embedding of 1" into 
The result for n = d — 1 follows from an application of the invariance of domain theorem |71l Exercise 7.6]. □ 

Lemma 41. Every continuous injective map from the sphere S^ 1 to itself is surjective. 

Proof. Suppose there was a non-surjective continuous injective map / from S ,d_1 to itself. Let s £ S^ 1 be any point 
which is not attained by /; then / can be interpreted as a continuous injective map from S^ -1 to the punctured sphere 
S^ 1 \ {s}, which is well-known [53] to be homeomorphic to Let g : S^ 1 \ {s} — > be a corresponding 

homcomorphism, then g o f is a continuous injective map from S ,rf_1 to contradicting Lemma 40 □ 



Appendix D: Simplified proof of the result of Section 4.3 in Ref. [54j for SO(d) 

In |54j . it has been shown that two noiseless d-dimensional Euclidean ball state spaces can be combined into an 
interacting, joint state space if and only if d = 3. In that paper, we have considered the general case of ball state 
spaces with any compact group of reversible transformations which is transitive on the unit sphere. However, here, 
we are only interested in the special case that the group of reversible transformations on a direction bit contains the 
full orthogonal group SO(d), as established in Lemma 18 It turns out that this simplifies the proof of a key lemma 
in [54] significantly. 

Here, we give the simplified proof, as a reference for readers who would like to follow the argumentation in |54) . 
Therefore, we do not introduce the relevant notation here in the appendix, but refer the reader to the introductory 
chapters of |54j . and just use the notation that has been introduced there. 

Lemma 42 (Section 4.3 in [5]E], special case of SO(d), d > 2). If a generator W £ g is of the block- diagonal form 



W 





Y 

0X0 

z 



then Z = X 



•Y , i.e. W generates non-interacting dynamics. 



Proof. Since W is antisymmetric, so are X and Y, which are thus generators of rotations. By assumption, we can 
perform the rotations exp(tX) ® exp(<y) on the joint system, which are generated by 



w 





Y 

X 

o o o a: 



o 




1 + 1 



Y 
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Since g is a Lie algebra, it also contains the element 

W" := W - W = diag(0, 0, 0, V), where V := X <g> 1 + 1 ®Y - Z is antisymmetric. 
Applying constraint (35) from [53] in the special case x = —a implies 
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for all a, b 6 E d with |a| = |b| = 1, which simplifies to (a <g> b) • NV 2 N 1 (a(g) b) > 0. Summing over all a and all 
b in an orthonormal basis yields < tv{NV 2 N^ 1 ) = tv{V 2 ). But V T V > 0, and since V T = -V, we have V 2 < 0, 
hence ti(V 2 ) < 0. Both inequalities together give tr(V 2 ) = 0, which is only possible if V 2 = 0, and thus V = 0. □ 



